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Preface 


This book is divided into two parts. 

The first one is purely algebraic. Its objective is the classification of 
quadratic forms over the field of rational numbers (Hasse-Minkowski 
theorem). It is achieved in Chapter IV. The first three chapters contain some 
preliminaries: quadratic reciprocity law, p-adic fields, Hilbert symbols. 
Chapter V applies the preceding results to integral quadratic forms of 
discriminant + 1. These forms occur in various questions: modular functions, 
differential topology, finite groups. 

The second part (Chapters VI and VII) uses “‘analytic”’ methods (holomor- 
phic functions). Chapter VI gives the proof of the “‘theorem on arithmetic 
progressions” due to Dirichlet; this theorem is used at a critical point in the 
first part (Chapter III, no. 2.2). Chapter VII deals with modular forms, 
and in particular, with theta functions. Some of the quadratic forms of 
Chapter V reappear here. 

The two parts correspond to lectures given in 1962 and 1964 to second 
year students at the Ecole Normale Supérieure. A redaction of these lectures 
in the form of duplicated notes, was made by J.-J. Sansuc (Chapters I-IV) 
and J.-P. Ramis and G. Ruget (Chapters VI-VII). They were very useful to 
me; I extend here my gratitude to their authors. 


J.-P. Serre 
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A Course in Arithmetic 


Part I 


Algebraic Methods 


Chapter I 


Finite Fields 
All fields considered below are supposed commutative. 


§1. Generalities 
1.1. Finite fields 


Let K be a field. The image of Z in K is an integral domain, hence 
isomorphic to Z or to Z/pZ, where p is prime; its field of fractions is iso- 
morphic to Q or to Z/pZ = F,. In the first case, one says that K is of 
characteristic zero; in the second case, that K is of characteristic p. 

The characteristic of K is denoted by char(K). If char(K) = p + 0, pis 
also the smallest integer 2 >0 such that x.1 = 0. 


Lemma.—/f char(K) = p, the map a: x ++ x? is an isomorphism of K onto 
one of its subfields K?. 

We have o(xy) = o(x)o(y). Moreover, the binomial coefficient (7) iS 
congruent to 0 (mod p) if 0<k<p. From this it follows that 


o(x+y) = o(x)+o(y); 
hence o is a homomorphism. Furthermore, o is clearly injective. 


Theorem 1.—i) The characteristic of a finite field K is a prime number 
p + 0; if f = [K:F,], the number of elements of K is q = p’. 

ii) Let p be a prime number and let q = p!(f = 1) be a power of p. Let 
Q be an algebraically closed field of characteristic p. There exists a unique 
subfield F,, of Q which has q elements. It is the set of roots of the polynomial 
X4— X, 

iii) All finite fieids with q = p! elements are isomorphic to F,. 

If K is finite, it does not contain the field Q. Hence its characteristic is a 
prime number p. If f is the degree of the extension K/F,, it is clear that 
Card(K) = p’, and i) follows. 

On the other hand, if © is algebraically closed of characteristic p, the 
above lemma shows that the map x++ x? (where g = p’, f2 1) is an 
automorphism of 2; indeed, this map is the f—th iterate of the automorphism 
ao: x ++ x? (note that o is surjective since Q is algebraically closed). Therefore, 
the elements x € Q invariant by x ++ x‘ form a subfield F, of 2. The derivative 
of the polynomial X4— YX is 


gX4-'—| = ppl Xt 1-1 = -1 
3 
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and is not zero. This implies (since {2 is algebraically closed) that ¥4—¥X 
has q distinct roots, hence Card(F,) = g. Conversely, if K is a subfield of 2 
with q elements, the multiplicative group K* of nonzero elements in K has 
q—1 elements. Then x*~! = | if xe K* and x? = x if xe K. This proves 
that K is contained in F,. Since Card(K) = Card(F,) we have K = F, which 
completes the proof of ii). 

Assertion iii) follows from ii) and from the fact that all fields with p/ 
elements can be embedded in Q since 2 is algebraically closed. 


1.2. The multiplicative group of a finite field 


Let p be a prime number, let / be an integer =1, and let g = p’. 


Theorem 2.—The multiplicative group Ff} of a finite field F, is cyclic of 
order q—1. 


Proof. \f dis an integer 2 I, recall that ¢(d) denotes the Euler ¢-function, 
i.e. the number of integers x with | < x < d which are prime to d (in other 
words, whose image in Z/dZ is a generator of this group). It is clear that the 
number of generators of a cyclic group of order d is ¢(d). 


Lemma 1.—/fn is an integer 21, thenn = 2 $d). (Recall that the nota- 
tion d|n means that d divides n). 

If d divides n, let C, be the unique subgroup of Z/nZ of order d, and 
let ©, be the set of generators of C,. Since all elements of Z/nZ generate 
one of the C,, the group Z/nZ is the disjoint union of the ®, and we have 


n = Card(Z/nZ) = de Card(®,) = d, d(d). 


Lemma 2.—Let H be a finite group of order n. Suppose that, for all divisors 
d of n, the set of x ¢ H such that x‘ = | has at most d elements. Then H is 
cyclic. 

Let d be a divisor of n. If there exists x e H of order d, the subgroup 
(x) = {1, x,..., x47 '} generated by x is cyclic of order d; in view of the 
hypothesis, all elements y « H such that y* = 1 belong to (x). In particular, 
all elements of H of order d are generators of (x) and these are in number 
¢(d). Hence, the number of elements of H of order d is 0 or ¢(d). If it were 
zero for a value of d, the formula n = 2 4(4) would show that the number 


of elements in /{ is <n, contrary to hypothesis. In particular, there exists an 
element xe H of order n and H coincides with the cyclic group (x). 


Theorem 2 follows from lemma 2 applied to H = Ff and n =q—1; 
it is indeed obvious that the equation x4 = 1, which has degree d, has at 
most d solutions in F,. 


Remark. The above proof shows more generally that all finite subgroups 
of the multiplicative group of a field are cyclic. 
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§2. Equations over a finite field 
Let g be a power of a prime number p, and let K bea field with g elements. 
2.1. Power sums 


Lemma.—Let u be an integer =O. The sum S(X“) = 2 x" is equal to —| 
xe 


ifuis 21 and divisible by q—1; it is equal to 0 otherwise. 

(We agree that x“ = 1 if u = 0 even if x = 0.) 

If « = 0, all the terms of the sum are equal to 1; hence S(X“) = q.1 = 0 
because K 1s of characteristic p. 

If wis 21 and divisible by g—1, we have 0° = 0 and x” = 1 if x + 0. 
Hence S(X") = (q—1).1 = —-1. 

Finally, if wis 21 and not divisible by g—1, the fact that K* is cyclic 
of order g— 1(th. 2) shows that there exists y e K* such that y” + 1. One has: 
S(X") _ Y yt = y ytx" _— y"S(X") 

xeK* xeK* 
and (1 — y*)S(X") = 0 which implies that SCX") = 0. 
(Variant—Use the fact that, if d 2 2 is prime to p, the sum of the d—th 
roots of unity is zero.) 


2.2. Chevalley theorem 


Theorem 3 (Chevalley —Warning).—Let f,¢ K[X,,...,X,] be poly- 
nomials in n variables such that X deg f, <n, and let V be the set of their 
common zeros in K". One has 


Card(V) = 0 (mod p). 
Put P = [](1-/27') and let xe K”.If xe V, all the /,(x) are zero and 


P(x) = 1; if x¢€V, one of the f,(x) is nonzero and f,(x)'~! = 1, hence 
P(x) = 0. Thus P is the characteristic function of V. If, for every polynomial 
f, we put S(f) = = f(x), we have 

xeK" 


Card(V) = S(P) (mod p) 


and we are reduced to showing that S(P) = 0. 

Now the hypothesis & deg f, < 1 implies that deg P < n(qg—1); thus P 
is a linear combination of monomials X" = Xj... X¥* with Xu, < n(g—1). 
It suffices to prove that, for such a monomial X“, we have S(X“) = 0, and 
this follows from the lemma since at least one u; is <q—1. 


Corollary 1.— If % deg f, <n and if the f, have no constant term, then the f, 
have a nontrivial common zero. 

Indeed, if V were reduced to {0}, Card(V) would not be divisible by p. 

Corollary 1 applies notably when the /, are homogeneous. In particular: 
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Corollary 2.—A/l quadratic forms in at least 3 variables over K have a 
non trivial zero. 

(In geometric language: every conic over a finite field has a rational 
point.) 


§3. Quadratic reciprocity law 
3.1. Squares in F, 


Let g be a power of a prime number p. 


Theorem 4.--(a) Jf p = 2, then all elements of F, are squares. 

(b) Uf p + 2, then the squares of FZ form a subgroup of index 2 in FT; 
this subgroup is the kernel of the homomorphism x +> x“@~')!? with values 
in {+1}. 

(In other terms, one has an exact sequence: 


| > F¥? + F*¥ > {+1} > 1) 


Case (a) follows from the fact that x+» x? is an automorphism of F,. 
In case (b), let 2 be an algebraic closure of F,; if xe FY, let ye Q be 
such that y? = x. We have: 


yh = xGQ7D/? = +] since x?7! = 1. 


For x to be a square in F, it is necessary and sufficient that y belongs to FY, 
ie. yt! = 1. Hence F*? is the kernel of x++ x“~'”/?, Moreover, since F¥ 
is cyclic of order g—1, the index of F*? is equal to 2. 


3.2. Legendre symbol (elementary case) 


Definition.—Let p be a prime number +2, and let xe F*. The Legendre 

symbol of x, denoted by (*). is the integer x?~'/? = +1}, 
Pp 

0 


-- | = 0. Moreover, 
p 


It is convenient to extend (*) to all of F, by putting ( 
| x x’ 
if x e Z has for image x’ < F,, one writes (=) = (=) 
We have (*) (”) = (**) The Legendre symbol is a “‘character”’ (cf. 
P} \P P 
chap. VI, §1). As seen in theorem 4, *) = | is equivalent to x e F*?; if 


; . x _ 
x € F* has y as a square root in an algebraic closure of F,, then (*) = yPoh 
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Computation of (*) for x = 1, -—1,2: 
If n is an odd integer, let e(n) and w(n) be the elements of Z/2Z defined by: 


on) = "= (mod 2) = {0 if” = 1 (mod 4) 
y | ifn = —1 (mod 4) 


HI 


2_ 7 
w(n) =" | (mod 2) = Oifn = +1 (mod 8) 
8 lifn = +5(mod 8) 


[The function « is a homomorphism of the multiplicative group (Z/4Z)* 
onto Z/2Z; similarly, w is a homomorphism of (Z/8Z)* onto Z/2Z.] 


Theorem 5.—The following formulas hold: 


Only the last deserves a proof. If « denotes a primitive 8th root of unity 
in an algebraic closure Q of F,, the element y = «+a ‘ verifies y? = 2 
—1 it follows that «7+a~? = 0). We have 


(from a* = 
y? = arta ?. 


If p = +1 (mod 8), this implies y’ = y, thus (5) =y'= ]Iifp= +5 


(mod 8), one finds y? = «°+a~° = —(a+«~') = —y. (This again follows 


from «* = —1.) We deduce from this that y?~! = —1, whence iii) follows. 


Remark. Theorem 5 can be expressed in the following way: 
— 1 is a square (mod p) if and only if p = 1 (mod 4). 
2 is a square (mod p) if and only if p = +1 (mod 8). 


3.3. Quadratic reciprocity law 


Let / and p be two distinct prime numbers different from 2. 


Theorem 6 (Gauss).— (") = (? (— 1) DCP), 


Let 2 be an algebraic closure of F,, and let we be a primitive /-th 
root of unity. If xe F,, the element w” is well defined since w! = 1. Thus 


we are able to form the ‘‘Gauss sum”’: 


y= > (7) wr. 
xéF, l 


Lemma 1.—y? = (— 1) 7. 
(By abuse of notation / denotes also the image of / in the field F,.) 
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We have 
XZ\ 4; 4 tu—t 
y= (“wer = yw o(S—)) 
X,z | ueF; (eF, l 
Now if ¢ + 0: 


t(u-)\ _ (-@\fl-ut ty) yen (Leu! 
5) =F) oer(EF). 


(-1)?y? = Y Cw’, 


and 


where 


Ifu=0, Co = D @ = /—1; otherwise s = |—ut~' runs over F,—{1}, 
1€F? 


«= 3) (= -(7)- 


since in F* there are as many squares as non squares. Hence 2 G wi = 
/—1— &% w* =/, which proves the lemma. 


ucF; 


Lemma 2.—y?~' = (7) 


Since 2 is of characteristic p, we have 


and we have 


Theorem 6 is now immediate. Indeed, by lemmas | and 2, 


(men) =r =() 
p l 


and the second part of th. 5 proves that 


(oO) = (— 1) ete), 
Pp 


Translation.—Write /Rp if ] is a square (mod p) (that is to say, if / is a 
‘quadratic residue’’ modulo p) and [Np otherwise. Theorem 6 means that 


‘ 


[Rp <>pRlI ifp or 1=1 = (mod 4) 
IRp <> pNI if pand/ = —1 (mod 4). 
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Remark. Theorem 6 can be used to compute Legendre symbols by 
successive reductions. Thus: 


)=(8)-(8)-(3)8)--)--@)--0)- 


Another proof of the quadratic reciprocity law (G. EISENSTEIN, J. Crelle, 
29, 1845, pp. 177-184.) 


1) Gauss Lemma 


Let p be a prime number +2, and let S be a subset of F¥ such that F? is 


the disjoint union of Sand — S. In the following we take S -{ lL... 


a) 
howl 
I+ =" 


If se Sand a « F¥, we write as in the form as = e,(a)s, with e,(a) I 


and s,¢S. 


seS 
Remark first that, if s and s’ are two distinct elements of S, then s, + s, 
(for otherwise s = +5’ contrary to the choice of S). This shows that sr 5, 
is a bijection of S onto itself. Multiplying the equalities as = e,(a)s,, we 


obtain 
gQP-O2 TT s = (I] e.(a)) I] 5, = (I] ea)) I] 5, 


seS seS seS 


Lemma (Gauss) —{ 7\ [| e,(@). 
p 


hence 
gh IW2 Il e.(@); 


se§ 


, a ~1y/2 ; 
this proves the lemma since (° = a?" in F 
P 


Example.—Take a = 2 and S = ! a ot. We have e(2) = 1 if 


2s < pal and e,(2) = —1 otherwise. From this we get @ = (—1)") 


2 
; , p-\ p-\ , 
where n(p) is the number of integers s such that a <ss > If p is 


of the form 4k + | (resp. 4k — 1), then n(p) = k. Thus we recover the fact 
that @ = lifp = +1 (mod 8) and () = —lifp = +5(mod 8), cf. th. 5. 
p Pp 


1) A trigonometric lemma 


Lemma.—Let m be a positive odd integer. One has 
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sin mx _ . . 5 2a] 
= (-4ye DP il sin? x—sin? — ) 

sin x 1<js(m—-1)/2 m 


This is elementary (for instance, prove first that sin (mx)/sin (x) is a poly- 
nomial of degree (m—1)/2 in sin? x, then remark that this polynomial has 


for roots the sin? on) with | <j S$ (m—1)/2; the factor (—4)'""!/? is 
m 


obtained by comparing coefficients of e"~ '* on both sides). 
iii) Proof of the quadratic reciprocity law 


Let / and p be two distinct prime numbers different from 2. Let 


S={1,...,(p—1)/2} 


as above. From Gauss’ lemma, we get 


() = Deo 
P seS 


Now the equality /s = e,(/)s, shows that 


sin on ls = e,(/) sin on S). 
P 


Multiplying these equalities, and taking into account that sts, is a bi- 
jection, we get: 

6, = [Je(l) = [] sin 2 / sin 275. 

seS seS Dp P 

By applying the trigonometric lemma with m = /, we can rewrite this: 


6 = [](-4y [] (sin ~ sin? *) 
Pp 


P seS eT 


= (-4) 000% TT] (sine 2 — sin? *). 
P 


seS,teT l 


where T denotes the set of integers between | and (/—1)/2. Permuting the 
roles of / and p, we obtain similarly: 


PY = (-ayt-pe-ms TY (sin? 22 — sin? 25) 
l seS, teT [ Dp 


The factors giving ( id and (7) are identical up to sign. Since there are 
p 
(p—1) (/—1)/4 of these, we find: 


6 = (7) [Je- DUH D/4, 
p l 


This is the quadratic reciprocity law, cf. th. 6. 


Chapter I 


p-Adic Fields 
In this chapter p denotes a prime number. 


§1. The ring Z, and the field Q, 


1.1. Definitions 


For every n = 1, let A, = Z/p"Z; it is the ring of classes of integers 


(mod p"). An element of A, defines in an obvious way an element of A, _,; 
we thus obtain a homomorphism 


pn: A, > A,-1) 


which is surjective and whose kernel is p"~ 'A,,. 
The sequence 
>A, — A,-1 > -.- 7A > A, 


forms a “‘projective system” indexed by the integers 2 1. 


Definition 1.—The ring of p-adic integers Z, is the projective limit of the 
system (A,, ¢,) defined above. 

By definition, an element of Z, = lim (A,, $n) 1S a Sequence x = 
(...,Xq_---,X,) with x, eA, and ¢,(x,) = x,-, If nm 2 2. Addition and 
multiplication in Z, are defined “coordinate by coordinate’. In other words, 
Z, is a subring of the product Il 4,. If we give A, the discrete topology and 


Il A, the product topology, the ring Z, inherits a topology which turns it 
into a compact space (since it is closed in a product of compact spaces). 


1.2. Properties of Z, 


Let «,: Z, — A, be the function which associates to a p-adic integer x its 
n-th component x,. 


Proposition 1.—7he sequence 0 > Z, ’, Z, 3A » — 0 is an exact sequence 
of abelian groups. 
(Thus we can identify Z,/p"Z, with A, = Z/p"Z.) 

Multiplication by p (hence also by p") is injective in Z,; indeed, if 
x = (x,) iS a p-adic integer such that px = 0, we have px,,, = 0 for all n, 
and x,4, is of the form p"y,., with y,41; €Age,3 since x, = $,4;(X%,41), we 
see that x, is also divisible by p", hence, is zero. 


Il 
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It is clear that the kernel of «, contains p"Z,; conversely, if x = (x,,) 
belongs to ker(e,,), one has x,, = 0 (mod p") for all m 2 n which means that 
there exists a well defined element y,,_, of A,,-, Such that its image under 
the isomorphism A,,_, > p"Z/p"Z © A,, Satisfies x, = P"Ymn-n- The y; define 
an element y of Z, = lim.A,;, and one checks immediately that p"y = x, 


which proves the proposition. 


Proposition 2.—(a) For an element of Z, (resp. of A,) to be invertible it 
is necessary and sufficient that it is not divisible by p. 

(b) If U denotes the group of invertible elements of Z,, every nonzero 
element of Z, can be written uniquely in the form p"u with ue U and n 2 0. 
(An element of U is called a p-adic unit.) 

It suffices to prove (a) for A,; the case of Z, will follow. Now, if x € A, 
does not belong to pA,, its image in A, = F, is not zero, thus invertible: 
hence there exists y, ze A, such that xy = 1 —pz, hence 


xyitpzt+... +p" '2"-') = 1, 


which proves that x is invertible. 

On the other hand, if x « Z, is not zero, there exists a largest integer n 
such that x, = «,(x) Is zero; then x = p"u with uw not divisible by p, hence 
ueéU by (a). The uniqueness of the decomposition is clear. 


Notation.—Let x be a nonzero element of Z,; write x in the form p"u 
with ue U. The integer 7 is called the p-adic valuation of x and denoted by 
v,(x). We put v,(0) = +0 and we have 


v(xy) = v,(x)+0,(y), v,(xt+y) 2 inf (v,(>), »,(y) 
It follows easily from these formulas that Z, is an integral domain. 
Proposition 3.—The topology on Z, can be defined by the distance 
d(x, y) = e Poy), 


The ring Z, is a complete metric space in which Z is dense. 

The ideals p"Z, form a basis of neighborhoods of 0; since x « p"Z, is 
equivalent to v,(x) 2 n, the topology on Z, is defined by the distance 
d(x, y) = e~’?="»”, Since Z, is compact, it is complete. Finally, if x = (x,) 
is an element of Z,, and if y, ¢ Z is such that y, = x, (mod p"), then lim.y, = 
x, which proves that Z is dense in Z,. 


1.3. The field Q, 


Definition 2.—The field of p-adic numbers, denoted by Q,, is the field of 
fractions of the ring Z,. 

One sees immediately that Q, = Z,[p~')]. Every element x of Q> can be 
written uniquely in the form p"u with n e Z, ue U; here again, » is called the 
p-adic valuation of x and is denoted by v,(x). One has v,(x) 2 0 if and only 
if x e Z,. 
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Proposition 4.—The field Q,, with the topology defined by d(x, y) = 
e~ *®*—Y), is locally compact, and contains Z, as an open subring; the field Q 
is dense in Q,. 

This is clear. 


Remarks.—1) We could have defined Q, (resp. Z,) as the completion 
of Q (resp. Z) for the p-adic distance d. 
2) The distance d satisfies the “‘u/trametric”’ inequality 


A(x, z) & sup (d(x, y), d(y, z)). 
From this one sees that a sequence u, has a limit if and only if 
lim . (Uj+1—Un) = 9; 


similarly, a series converges if and only if its general term tends to 0. 


§2. p-adic equations 


2.1. Solutions 


Lemma.—Let...— D, > D,_, >... — D, be a projective system, and 
let D= lim . D, be its projective limit. If the D, are finite and nonempty, 


then D is nonempty. 

The fact that D + @ is clear if the D, > D,_, are surjective; we are 
going to reduce the lemma to this special case. For this, denote by D,_, the 
image of D,,, in D,; for fixed n, the D, , form a decreasing family of finite 
nonempty subsets; hence this family is stationary, i.e. D,_, is independent 
of p for p large enough. Let E, be this limit value of the D, ,. One checks 
immediately that D, > D,-_, carries E, onto E,_,; since the E, are non- 
empty, we have lim . E, + @ by the remark made at the beginning; hence. 


a fortiori lim .D, + 2. 


Notation.—lf fe Z[X,,..., Xn] t8 a polynomial with coefficients in 
Z,, and if n is an integer 2 1, we denote by f, the polynomial with coefficients 
in A, deduced from f by reduction (mod p"). 


Proposition 5.—Let f « Z[X,,..., Xn] be polynomials with p-adic 
integer coefficients. The following are equivalent: 

i) The f have a common zero in (Z,)”. 

il) For all n > 1, the polynomials f° have a common zero in (A,)". 

Let D (resp. D,) be the set of common zeros of the f“” (resp. f.“”). The 
D,, are finite and we have D = lim . D,. By the above lemma, D is nonempty 


if and only if the D, are nonempty; hence the proposition. 


A point x = (x,,..., X,) of (Z,)" is called primitive if one of the x; is 
invertible, that is, if the x; are not all divisible by p. One defines in a similar 
way the primitive elements of (A,)”. 


14 p-adic fields 
Proposition 6.—Let f“ ¢ Z,[X,,..., Xm] be homogeneous polynomials 
with p-adic integer coefficients. The following are equivalent: 


a) The f have a non trivial common zero in (Q,)”. 
b) The f‘ have a common primitive zero in (Z,)”. 
c) For all n > 1, the f. have a common primitive zero in (A,)". 
The implication b) => a) is trivial. Conversely, if x = (x,,...,X,) IS a 
nontrivial common zero of the f, put 
h = inf(v,(x1),.-. 5 Up(%m)) and y = p "x. 


It is clear that y is a primitive element of (Z,)", and that it is a common 
zero of the f. Hence b) <> a). 
- The equivalence of b) and c) follows from the above lemma. 


2.2. Amelioration of approximate solutions. 


We are concerned with passing from a solution (mod p”) to a true 
solution (i.e. with coefficients in Z,). One uses the following lemma (p-adic 
analogue of ‘“‘Newton’s method’’): 


Lemma.—Let f< Z, [X] and let f’ be its derivative. Let xe Z,, n,k eZ 
such that 0 S$ 2k < n, f(x) = 0 (mod p"), v,(f'(x)) = k. Then there exists 
y € Z, such that 

f(y) = 0(mod p"*"), v,(f'(y)) = k, and y = x (mod p"~*). 
Take y of the form x+p" “z with z « Z,. By Taylor’s formula we have 
Sy”) = fx) +p" “2f'(x) +p”"" **a_ with ae Z,. 


By hypothesis f(x) = p"b and f'(x) = p*c with b « Z, and c € U. This allows 
us to choose z in such a way that 


b+zc = 0(mod p). 
From this we get 
f(y) = p(b+20c)+p*"" *“a = 0 (mod p"**) 


since 2n—2k > n. Finally Taylor's formula applied to /’ shows that 
f'(y) = p*c (mod p"~"); since n—k > k, we see that v,(f’(y)) = k. 


Theorem 1.—Let fe Z,[X,,..., Xml, X = (x;) €(Z,)", n, k ¢ Z and j an 
integer such that 0 < j < m. Suppose that 0 < 2k <n and that 


f(x) = 0(mod p") and v, (4 (9 =k, 


Then there exists a zero y of f in (Z,)" which is congruent to x modulo p”*. 


Suppose first that m = 1. By applying the above lemma to x = x, 
we obtain x") « Z, congruent to x‘ (mod p"~“) and such that 


f(x) = 0 (mod p"*") and v,(f'(x™)) = k. 
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We can apply the lemma to x"), after replacing n by n+1. Arguing induc- 
tively, we construct in this way a sequence x‘, ..., x... such that 


xt) = yl) (mod prta-ky. Se) = 0 (mod p"*). 


This is a Cauchy sequence. If y is its limit, we have f(y) = 0 and y = x 
(mod p"~*), hence the theorem for m = 1. 

The case m > | reduces to the case m = | by modifying only x;. More 
precisely, let f eZ,[X,] be the polynomial in one variable obtained by 
replacing X;, i + j, by x;. What has just been proven can be applied to f and 
x;; this shows the existence of y; = x; (mod p"~“) such that f(y,) = 0. If 
one puts y; = x; fori + j, the element y = ()y;) satisfies the desired condition. 


Corollary 1.—Every simple zero of the reduction modulo p of a polynomial 
f lifts to a zero of f with coefficients in Z,. 

(If g is a polynomial over a field k, a zero x of g is called simple if at 
least one of the partial derivatives ¢g/@X, is nonzero at x.) 

This 1s the special case n = 1, k = 0. 


Corollary 2.—Suppose p + 2. Let f(X) = 2a,,;X;X; with a,; = a;; be a 
quadratic form with coefficients in Z, whose discriminant det(a;,;) is invertible. 
Let ae Z,. Every primitive solution of the equation f(x) = a (mod p) lifts to a 
true solution. 

In view of cor. 1, it suffices to show that x does not annihilate all the 


; oo, e . ; 
partial derivatives of f modulo p. Now f = 22 ,a;;X;; since det(a;;) + 0 
(mod p) and x is primitive, one of these partial derivatives is #0 (mod p). 
Corollary 3.—Suppose p = 2. Let f = Xa,;X;X; with a;; = a;, be a 
quadratic form with coefficients in Z, and let ae Z,. Let x be a primitive 
solution of f(x) = a (mod 8). We can lift x to a true solution provided x does 


not annthilate all the SF modulo 4; this last condition is fulfilled if det(a; ;) is 
invertible. OX; 

The first assertion follows from the theorem applied to n = 3, k = 1; the 
second can be proved as in the case p + 2 (taking into account the factor 2). 


§3. The multiplicative group of Q, 
3.1. The filtration of the group of units 


Let U = Z} be the group of p-adic units. For every n 2 1, put U, = 
1+p"Z,; this is the kernel of the homomorphism e«,: U — (Z/p"Z)*. In 
particular, the quotient U/U, can be identified with F¥, hence is cyclic of 
order p—1 (cf. Chap. I, th. 2). The U, form a decreasing sequence of open 
subgroups of U, and U = lim. U/U,. If n 2 1, the map 


(1+p"x) (x modulo p) 
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defines an isomorphism U,/U, 4, -> Z/pZ; this follows from the formula: 


(l+p"x) (1 +p"y) = 14+p"x+y) (mod p"*"). 


We see from this, by induction on n, that U,/U, has order p"*. 


Lemma.—Let 0 -» A ~ E -> B -> 0 be an exact sequence of commutative 
groups (denoted additively) with A and B finite with orders a and b prime to 
each other. Let B’ be the set of x ¢ E such that bx = 0. The group E is the 
direct sum of A and B'. Moreover B' is the only subgroup of E isomorphic to B. 

Since a and bare relatively prime, there exist r, s ¢ Z such that ar+bs = 1. 
IfxeAo B’, then ax = bx = 0, hence (ar+ bs) x = x =O; and ANB = 
0. Moreover, all x « E can be written x = arx+bsx; since bB’ = 0, we have 
bE < A, hence bsx ¢ A; on the other hand, from abE = 0 follows that 
arx © B’. Hence we see that E = A @ B’ and the projection E — B defines 
an isomorphism of B’ onto B. Conversely, if B” is a subgroup of E iso- 
morphic to B, we have 6B” = 0 hence B” © B’ and B" = B’ because these 
groups have the same order. 


Proposition 7.--One has U = Vx U, where V = {x € U|x?~! = 1} is the 
unique subgroup of U isomorphic to FY. 
One applies the lemma to the exact sequences 


| > U,/U, ~ U/U, — FR > 1, 


which is allowable because the order of U,/U, is p"~' and the order of F% 
is p—1!. From this, one concludes that U/U, contains a unique subgroup V,, 
isomorphic to F¥ and the projection 


U/U, — U/U,- 1 
carries V, isomorphically onto V,_,. Since U = lim. U/U,, we get from 
this, by passage to the limit, a subgroup V of U isomorphic to F¥. One has 
U = VxU,; the uniqueness of V follows from that of the V,,. 
Corollary.—The field Q, contains the (p—1)th roots of unity. 


Remarks—1) The group V is called the group of multiplicative repre- 
sentatives of the elements of FF. 

2) The existence of V can also be proved by applying cor. | of th. | to 
the equation x77 '—1 = 0. 


3.2. Structure of the group U, 


Lemma.—Let x ¢ U,—U,4, with n2 1 if pp +2 and n2 2 if p =2. 
Then x? € U,4,—-U, 42. 
By hypothesis, one has x = 1+kp" with k = 0 (mod p). The binomial 
formula gives 
xP = L4+kp"*'+... +k?p"? 


and the exponents in the terms not written are 22n+1, hence also 2n+2. 
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Moreover np 2 n+2 (due to the fact that n 2 2 if p = 2). This shows that 


xP = 1+kp"*! (mod p"**) 
hence x? € U,,, —U, +>. 


Proposition 8.—/f p + 2, U, is isomorphic to Z,. 
If p = 2, U, = {+1} U, and U, is isomorphic to Z,. 

Consider first the case p + 2. Choose an element a¢U,-—U,, for 
example « = |+p. By the above lemma, we have «? ¢ U;,,—U;,>. Let 
a, be the image of « in U,/U,; we have («,)?" ° + 1 and (a,)?" ‘= 1. But 
U,/U, is of order p"~'; hence it is a cyclic group, generated by «,. Now, 
denote by 6, , the isomorphism z +> «7 of Z/p""'Z onto U,/U,. The diagram 


Ont+1t.0 


Z/p'~ ————> U,/U, + 1 


| | 


Z/p"'Z "> U,/U, 
is commutative. From this one sees that the 6, , define an isomorphism 
0 of Z, = lim. Z/p""'Z onto U, = lim. U,/U,, hence the proposition for 
+2 <— <— 
p 
Suppose now that p = 2. Choose a ¢ U, —Us;, that is « = 5 (mod 8). 
Define as above isomorphisms 


6,4: Z/2""-*Z > U2/U,, 
hence an isomorphism 6,: Z, —» U,. On the other hand, the homomorphism 
U, — U,/U, ~ Z/2Z 
induces an isomorphism of { +1} onto Z/2Z. From this we get 
U, ={+1}xU,, qed. 


Theorem 2.—The group Q} is isomorphic to Zx Z, x Z[(p—1)Z if p + 2 
and to Zx Z, x Z/2Z if p = 2. 

Every element x ¢ QF can be written uniquely in the form x = p"u with 
neéeZ and ue U. Hence QF ~ ZxU. Moreover, prop. 7 proves that U = 
VxU, where V is cyclic of order p—1, and the structure of U, is given by 
prop. 8. 


3.3. Squares in QF 


Theorem 3.—Suppose p + 2 and let x = p"u be an element of Q*, with 
neZanduev. For x to be a square it is necessary and sufficient that n is 
even and the image u of u in F¥ = U/U, is a square. 


(The last condition means that the Legendre symbol (*) of uw Is equal 
- p 
to 1. We write in the following (*) instead of (7) ) 
Pp P 
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Decompose u in the form u = v.u, with v € V and u, € U,. The decom- 
position Q¥ ~ ZxVxU, of th. 2 proves that x is a square if and only if 
n is even and v and u, are squares; but U, is isomorphic to Z, and 2 is 
invertible in Z,; all the elements of U, are then squares. Since V is tso- 
morphic to F¥, the theorem follows. 


Corollary.—/f p + 2, the group Q*/Q*? is a group of type (2, 2). It has 
for representatives {1, p, u, up} where uc U is such that (*) = —|], 
p 


This is clear. 


Theorem 4.—For an element x = p"u of QF to be a square it is necessary 
and sufficient that n is even and u = 1 (mod 8). 

The decomposition U = { +1}xU, shows that u ts a square if and only 
if u belongs to U, and is a square in U,. Now the isomorphism @: Z, — U, 
constructed in the proof of prop. 8 carries 2"Z, onto U,, 2. Taking # = 1, 
we see that the set of squares of U, is equal to U,. An element uc U ts 
then a square if and only if it is congruent to 1 modulo 8, hence the theorem. 


Remark.—The fact that every element of U, 1s a square follows also 
from cor. 3 of th. | applied to the quadratic form X?”. 


Corollary.—The group Q¥/Q*? is of type (2, 2, 2). It has for representatives 
{+1, +5, +2, +10}. 

This follows from the fact that { +1, +5} 1s a system of representatives 
for U/U3. 


Remarks. 
1) For p = 2, define homomorphisms ¢e, w: U/U, — Z/2Z by means of 
the formulas of chap. I, n° 3.2: 


oe) = 7~! (mod2 = 19 fF 2 = M(mod 4) 
2 1 if z= —I (mod 4) 
2 _ 

w(2) = 71 (mod 2) = 19 if 2 = +1 (mod 8) 
6 | if z= +5 (mod 8). 


The map « defines an isomorphism of U/U, onto Z/2Z and the map w an 
isomorphism of U,/U, onto Z/2Z. The pair (ec, w) defines thus an isomorphism 
of U/U, onto Z/2Z x Z/2Z; in particular a 2-adic unit z is a square if and 
only if e(z) = w(z) = 0. 

2) Theorems 3 and 4 show that Q*? is an open subgroup of Q*. 


Chapter [II 
Hilbert Symbol 


§1. Local properties 


In this paragraph, k denotes either the field R of real numbers or the 
field Q, of p-adic numbers (p being a prime number). 


1.1. Definition and first properties 


Let a, be k*. We put: 
(a, b) = Lif z?—ax?—by* = Ohasa solution (z, x, y) + (0,0, 0) in k?. 
(a, b) = —1 otherwise. 
The number (a, b) = +1 is called the Hilbert symbol of a and b relative to k. 
It is clear that (a, b) does not change when a and b are multiplied by squares; 
thus the Hilbert symbol defines a map from k*/k*? x k*/k*? into {+1}. 


Proposition 1.—Let a, b «k* and let k, = k(,/b). For (a, 6) = 1 it is 
necessary and sufficient that a belongs to the group Nk* of norms of elements 
of k¥. 

If b is the square of an element c, the equation z?—ax?—by” = 0 has 
(c, 0, 1) for a solution, hence (a, b) = 1, and the proposition is clear in this 
case since k, = k and Nk; = k*. Otherwise, k, is quadratic over k; if B 
denotes a square root of b, every element £ ¢ k, can be written z+ fy with 
y, z€k and the norm N(€) of é is equal to z?— by’. If a ¢ Nkf, there exist 
y, z€k such that a = z*—by’, so that the quadratic form z?—ax?— by’ 
has a zero (z, I, y) and we have (a, 6) = 1. 

Conversely, if (a2, b) = 1, this form has a zero (z, x, y) + (0, 0, 0). One 
has x + 0, for otherwise 6 would be a square. From this we see that a is the 


norm of = + B2. 
x x 


Proposition 2.—The Hilbert symbol satisfies the formulas: 


i) (a, b) = (b, a) and (a,c*) = 1, 
li) (a, —a) = 1 and (a, 1—a) = 1, 
ili) (a, 6) = 1 = (aa’, b) = (a’, db), 
iv) (a, b) = (a, —ab) = (a, (1 —a)b). 


(In these formulas a, a’, b, c denote elements of k*; one supposes a + | 
when the formula contains the term | —a.) 

Formula i) is obvious. If 6 = —a (resp. if 6 = 1—a) the quadratic 
form z*—ax*—by* has for zero (0, 1, 1) (resp. (1, 1, 1)); thus (a, 5) = 1, 
which proves ii). If (a, b) = 1, a is contained in the subgroup Nk}, cf. prop. 1; 
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we then have a’ « Nk¥ <> aa’ « Nk#, which proves ili). Formula iv) 
follows from 1), 11), 111). 


Remark.—Formula iti) is a particular case of 
v) (aa', b) = (a, b) (a’, bd), 
which expresses the bilinearity of the Hilbert symbol; this formula will be 
proved in the following section. 


1.2. Computation of (a, b) 


Theorem 1.—//k = R, we have (a, 6) = | ifaorbis >0, and(a,b) = —1 
if a and b are <0. 

Ifk = Q, and if we write a, b in the form p*u, pv where u and v belong to 
the group U of p-adic units, we have 


B a 
(a,b) = (- preso(2) (5) ifp +2 


(a, b) = (- ] eCudeCv) + 2ea(v) + Boru) if p — 7. 


[Recall that “'\ denotes the Legendre symbol “\ where a is the image of u 
P Pp 
by the homomorphism of reduction modulo p: U > F¥. As for e(u) and 


—_— 2 
w(u), they denote respectively the class modulo 2 of and of — . 


cf. Chap. IT, n° 3.3.] 


Theorem 2.—The Hilbert symbol is a nondegenerate bilinear form on the 
F,-vector space k*/k*?. 

[The bilinearity of (a, b) is just formula v) mentioned at the end of n° 1.1. 
The assertion ‘“‘(a, 6) is nondegenerate’”’ means that, if 6 ek* is such that 
(a, b) = 1 for all a © k*, one has b € k*?.] 


Corollary.—/f b is not a square, the group Nk defined in prop. \ is a 
subgroup of index 2 in k*. 

The homomorphism ¢,: k* >{+1} defined by ¢,(a) = (a, 6) has 
kernel Nks by prop. 1; moreover, ¢, is surjective since (a, b) is nondegener- 
ate. Hence ¢, defines an isomorphism of k*/Nk¥ onto {+1}; the corollary 
follows from this. 


Remark.—More generally, let L be a finite extension of &k which 1s 
galoisian and whose Galois group G 1s commutative. One can prove that 
k*/NL* is isomorphic to G and that the knowledge of the group NL* 
determines L. These are two of the main results of the so-called “‘local class 
field theory.”’ 


Proof of theorems | and 2. 
The case k = R is trivial. Note that k*/k*? is then a vector space of 
dimension | (over the field F,) having {1, — 1} for representatives. 


Local properties 21 
Suppose now that k = Q,. 


Lemma.—Let ve U be a p-adic unit. If the equation z* —px*—vy* = 0 
has a nontrivial solution in Q,, it has a solution (z, x, y) such that z, ye U 
and x € Z,. 

By prop. 6 of chap. II, n° 2.1, the given equation has a primitive solution 
(z, x, y). Let us show that this solution has the desired property. If it did not, 
we would have either y = 0 (mod p) or z = 0 (mod p); since z*—vy? = 0 
(mod p) and v = 0 (mod p), we would have both y = 0 (mod p) and z = 0 
(mod p), hence px? = 0 (mod p?), i.e. x = 0 (mod p) contrary to the primitive 
character of (z, x, y). 


We now return to the proof of theorem |, and we suppose first that p + 2. 
It is clear that the exponents a and 8 come in only by their residue 
modulo 2; in view of the symmetry of the Hilbert symbol, there are only 
three cases to consider: 
1) « = 0, B = 0. We must check that (u, v) = 1. Now the equation 


z*—ux*—vy? = 0 
has a nontrivial solution modulo p (chap. I, §2, cor. 2 to th. 3); since the 


discriminant of this quadratic form is a p-adic unit, the above solution 
lifts to a p-adic solution (chap. II, n° 2.2, cor. 2 to th. 1); hence (u, v) = I. 


2) « = 1, 8 = 0. We must check that (pu, v) = ”\. Since (u, v) = 1 we 
p 
have (pu, v) = (p, v) by formula iii) of prop. 2; thus it suffices to check that 
(p, v) = (?). This is clear if v is a square, the two terms being equal to 1. 
p 
Otherwise 6 = —1, see chap. II, n° 3.3, th. 3. Then the above lemma 


p 


shows that z* — px? — vy” does not have a nontrivial zero and so (p, v) = —1. 


3) a = 1,8 = 1. We must check that (pu, pv) = (—1)?7 1)? (“\ 2), 
P/\P 
Formula iv) of prop. 2 shows that: 
(pu, pv) = (pu, —p*uv) = (pu, —uv). 


By what we have just seen, (pu, pv) = | , from which the desired result 


follows since ( —! = (—1)07 Y/2, 
Pp 


Once theorem | is established (for p + 2), theorem 2 follows from it, 
since the formula giving (a, 5) is bilinear; in order to prove the nondegeneracy, 
it suffices to exhibit, for all a ¢ k*/k*? distinct from the neutral element, an 


element 5 such that (a, b) = —1. By cor. to th. 3 of chap. IT, n° 3.3, we can 
take a = p, u or up with we U such that ( M\ a + 1; then we choose for b 
p 


respectively, u, p, and wu. 
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The case p = 2. Here again, « and 8 come in only by their residue modulo 
2 and there are three cases to consider: 

1) « = 0, B = 0. We must check that (u, v) = | if u or v ts congruent to 
| (mod 4) and (u, v) = —1 otherwise. Suppose first that wu = | (mod 4). 
Then wu = | (mod 8) oru = 5 (mod 8). In the first case u is a square (chap. II, 
n° 3.3, th. 4) and we do have (u, v) = 1. In the second case we have u+4v = 1 
(mod 8) and there exists w ¢ U such that w? = u+4u; the form z? —ux? — vy? 
has thus (w, 1, 2) for a zero and we do have (u, v) = |. Let us now suppose 
u =v = —1 (mod 4); if (z, x, y) isa primitive solution of z?7—ux* —vy” = 0, 
then z7+.x?+y? = 0 (mod 4); but the squares of Z/4Z are 0 and 1; this 
congruence implies that x, y, z are congruent to 0 (mod 2), which contradicts 
the hypothesis of primitivity. Thus we have (u, v) = —1 in this case. 

2) « = 1, B = 0. We must check that (2u, v) = (— 1)" T°, First let 
us show that (2, v) = (—1)°”, ie. that (2, v) = 1 is equivalent tov = +1 
(mod 8). By the above lemma if (2, v) = 1, there exists x, y, z¢ Z, such 
that z7>—2x?—vy? = 0 and y, z = 0 (mod 2). Then we have y* = z? = | 
(mod 8), hence |1—2x?—v = 0 (mod 8). But the only squares modulo 8 are 
0, 1, and 4; from this we get v = +1 (mod 8). Conversely, if v = 1 (mod 8), 
v is a square and (2, v) = 1; if v = —1 (mod 8), the equation z?—2x? — vy’ 
= (0 has (1, I, 1) for a solution modulo 8, and this approximate solution 
lifts to a true solution (chap. II, n° 2.2, cor. 3 to th. 1); thus we have (2, v) 
= |, 

We show next that (2u, v) = (2, v) (u, v); by prop. 2, this is true if 
(2, v) = I or (u, v) = 1. The remaining case is (2, v) = (u, v) = —1, Le. 
v = 3 (mod 8) and u = 3 or —1 (mod 8); after multiplying u and v by 
squares, we can suppose that u = —1, v = 3 or u = 3, v = —5; now the 
equations 


27+2x?—3y? = Oand z7—6x?+5y? = 0 


have for solution (1, 1, 1); thus we have (2u, v) = 1. 
3) « = |, B = 1. We must check that 


(Qu, 2v) = (— 1) melo) + ou) + oC), 
Now formula tv) of prop. 2 shows that 

(lu, 2v) = (2u, —4uv) = (2u, — ut). 
By what we have just seen, we have 


(2u 2v) = (- J PCwret— ae) Fo(~ ue) 
, ° 


Since e(—1) = I, w(—1) = 0 and e(u) (1 +e(u)) = 0, the above exponent is 
equal to e(u)e(v)+w(u)+(v), which proves theorem |. The bilinearity of 
(a, b) follows from the formula giving this symbol, since « and w are homo- 
morphisms. The nondegeneracy is checked on the representatives {u, 2u} 
with uw = 1, 5, —1 or —5S. Indeed, we have (5, 2u) = —1 and 


(-1, -1) =(—I, -5) = -1. 
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Remark.—Write (a, 6) in the form (—1)'*") with [a, 6] « Z/2Z. Then 
(a, b] is a symmetric bilinear form on k*/k** with values in Z/2Z and th. 1 
gives its matrix with respect to some basis of k*/k*?: 

— For k = R, it is the matrix (1). 


— Fork = Q,, p + 2, with basis {p, u} where (*) = —I, it is the matrix 
Pp 


O01). _ Pl). 
(, 0) if p = 1 (mod 4) and (; 0} if p = 3 (mod 4). 


001 
— For k = Q,, with basis {2, —1, 5} it is the matrix ( | o) 
100 


é 


§&2. Global properties 


The field Q of rational numbers embeds as a subfield into each of the 
fields Q, and R. If a, be Q*, (a, 5), (resp. (a, ),,) denotes the Hilbert 
symbol of their images in Q, (resp. R). We define V to be the set of prime 
numbers together with the symbol oo, and make the convention that Q,, = 
R, hence Q is dense in Q, for all ve V. 


2.1. Product formula 


Theorem 3 (Hilbert).—/f a, b « Q*, we have (a, b), = 1 for almost all 
ve Vand 
[| (, 4), = 1. 
veVv 


(The expression “‘almost all » « V’? means “‘all the elements of V except a 
finite number’’.) 

Since the Hilbert symbols are bilinear, it suffices to prove the theorem 
when a or b are equal to —1 or to a prime number. In each case, theorem | 
gives the value of (a, b),. We find 

1)a= —1, b= —1. One has (—1, -1), =(-1, -—1),2 = —1 and 
(—1,-1), = 1 if p + 2, «©; the product is equal to 1. 

2) a= —1, b =/ with / prime. If / = 2, one has (—1, 2), = 1 for all 
vue V; if / + 2, one has (-1, /), = 1 if v + 2, /and (-1, J), =(-1,/), = 
(— 1). The product is equal to 1. 

3)a=/,6=T' with/, /’ primes. If / = /', formula iv) of prop. 2 shows 
that (/, J), = (1, J), for all v €¢ V and we are reduced to the case considered 
above. If / # /' and if /’ = 2, one has (/, 2), = 1 for v + 2, / and 


(I, 2), =(—1)°, (4, 2), = @, = (—1)°, cf. chap. I, n° 3.2, th. 5. 


If /and /’ are distinct and different from 2, one has (/, 7’), = 1 for v 4 2, /, ’and 


(1), = (-1YO"”, (LT), = (5) (4, ly = (=) 
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but by the quadratic reciprocity law (chap. I, n° 3.3, th. 6) one has 


l'\ fl ; 
~\f—) = (—1)oe, 
U(r)“ 


hence the product is equal to 1. This completes the proof. 


Remark.—The product formula is essentially equivalent to the quadratic 
reciprocity law. Its interest comes mainly from the fact that it extends to 
all algebraic number fields (the set V being replaced by the set of “‘places”’ 
of the field). 


2.2. Existence of rational numbers with given Hilbert symbols 


Theorem 4.—Let (a;);-, be a finite family of elements in Q* and let 
(€;, ier,vev De a family of numbers equal to +1. In order that there exists 
x € Q* such that (a;, x), = ¢;., for allie I and all ve V, it is necessary and 
sufficient that the following conditions be satisfied: 

(1) Almost all the «; , are equal to 1. 
(2) For allie I we have IT é,, = 1. 


(3) For all v € V there exists x, € Q* such that (a;, x,), = &,,for allie I. 
The necessity of (1) and (2) follows from theorem 3; that of (3) is trivial 
(take x, = x). 
To prove the sufficiency of these conditions, we need the following three 
lemmas: 


Lemma 1 (‘Chinese remainder theorem”’).—Let a,,...,Q,, ™,,...,M, 
be integers with the m, being pairwise relatively prime. There exists an integer a 
such that a = a; (mod m,) for all i. 

Let m be the product of m;. Bezout theorem shows that the canonical 
homomorphism 


Z|mZ > [] Z/m,Z 
i=1 
is an isomorphism. The lemma follows from this. 


Lemma 2 (“Approximation theorem’’).—Let S be a finite subset of V. 
The image of Q in IT Q,, is dense in this product (for the product topology 
vE 


of those of Q,). 

Being free to enlarge S, we can suppose that S = { 00, p,,...p,} where 
the p; are distinct prime numbers and we must prove that Q is dense in 
RxQ,,x ... XQ,,. Let (xo, %1,-.-++%,) be a point of this product and 
let us show that this point is adherent to Q. After multiplying by some 
integer, we may suppose that x; « Z,, for 1 S$ i S$ n. Now one has to prove 
that, for all e > 0 and all integers N > 0, there exists x « Q such that 


lxn—x,] S ¢€ and v,(x-x) 2N fori=1,...,n. 
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By lemma | applied to m; = p)’, there exists x9 ¢ Z such that v, (x9 — x;) 
> N for all i. Choose now an integer gq 2 2 which is prime to all the p; (for 
example a prime number). The rational numbers of the form a/q”, a « Z, 
m = 0, are dense in R (this follows simply from the fact g”™ — oo when 
m --> 00). Choose such a number u = a/q™ with 


Xo —X.,, tup)...p™| Se. 
The rational number x = xo+up;’...p) has the desired property. 


Lemma 3 (‘‘Dirichlet theorem’’).—/f a and m are relatively prime integers 
21, there exist infinitely many primes p such that p = a (mod m). 

The proof will be given in chap. VI; the reader can check that it uses 
none of the results of chapters III, 1V, and V. 


Now come back to theorem 4, and let (e; ,) be a family of numbers 
equal to +1 and satisfying conditions (1), (2), and (3). After multiplying 
the a; by the square of some integer, we can suppose that all the a; are 
integers. Let S be the subset of V made of o0, 2, and the prime factors of a;; 
let T be the set of ve V such that there exists i¢ / with e; , = —1; these 
two sets are finite. We distinguish two cases: 

1) We have SOT = @. 


Put 
a=|[]/ and m=8][]|1. 
leT leS 
[+00 !+:2,00 


Because SOT = o, the integers a and m are relatively prime and, by 
lemma 3, there exists a prime number p = a (mod m) with pé¢ SUT. We 
are going to show that x = ap has the desired property, t.e. (@;, x), = &., 
for allie Jandve V. 

If ve S, we have e; , = 1 since SO T= g, and we must check that 
(a;, x), = 1. If v = oo, this follows from x > 0; if v is a prime number /, 
we have x = a” (mod m), hence x = a? (mod 8) for / = 2 and x = a? (mod /) 
for / + 2; since x and a are /-adic units, this shows that x is a square in QF 
(cf. chap. HI, n° 3.3) and we have (a;, x), = I. 

If v = / 1s not in S, a; is an /-adic unit. Since / + 2 we have 


a. \ 0) 
(a;, 5), -(7) for all be Q), cf. th. 1. 


If /¢ TU{p}, x is an /-adic unit, hence v,(x) = 0 and the above formula 
Shows that (a,, x), = 1; on the other hand, we have e; , = 1 because / ¢ T. 
If /<¢ T, we have v,(x) = 1; moreover, condition (3) shows that there exists 
x, € Q? such that (a;, x;), = ¢;,, for all i¢/; since one of the e; , is equal 
to —I (because / belongs to 7), we have v,(x,) = | (mod 2) hence 


(4;, x), = (“') = (4;,X,), = &,,  forallie/. 
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There remains the case / = p, which we deduce from the others using 

the product formula: 
(a;, X)p = I] (a;, X), = I] fi, = fi, p- 
vAp vp 

This completes the proof of theorem 4 in the case SN T = @. 

2) General case. 
We know that the squares of Q* form an open subgroup of Q*, cf. chap. II, 
n° 3.3. By lemma 2, there exists x’ ¢ Q* such that x’/x, is a square in Q* for 
all ve S. In particular (a;, x’), = (a@;, x,), = €,, for all v eS. If we set 
Ni,» = €,,(a;, X'),, the family (y;.,) verifies conditions (1), (2), (3) and 
moreover 7; , = 1 if v eS. By 1) above there exists y¢Q* such that 
(a;, Y)y = 7;,, for allie 7 and all ve V. If we set x = yx’, it is clear that x 
has the desired properties. 


Chapter IV 


Quadratic Forms over Q, and over Q 


§1. Quadratic forms 
1.1. Definitions 


First recall the general notion of a quadratic form (see BOURBAKI, Alg., 
chap. IX, 3, n° 4). 


Definition 1.—Let V be a module over a commutative ring A. A function 
Q: V — A is called a quadratic form on V if: 


1) O(ax) = a*Q(x) forae A and xeV 
2) The function (x, y)++ O(x+ y)— O(x)— Q()) is a bilinear form. 


Such a pair (V, Q) is called a quadratic module. In this chapter, we limit 
ourselves to the case where the ring A is a field k of characteristic #2; the 
A-module V is then a k-vector space; we suppose that its dimension is finite. 

We put: 

xy = ${QO(x+y)— Ox) - Q(y)}; 


this makes sense since the characteristic of k is different from 2. The map 
(x, y)t> x.y 1S a symmetric bilinear form on V, called the scalar product 
associated with Q. One has Q(x) = x.x. This establishes a bijective corre- 
spondence between quadratic forms and symmetric bilinear forms (it would 
not be so in characteristic 2). 

If (V, Q) and (V’, Q’) are two quadratic modules, a linear map f: V > V’ 
such that Q’o f= Q 1s called a morphism (or metric morphism) of (V, Q) 
into (V’, Q’); then f(x). f(y) = x.y for all x, ye V. 


Matrix of a quadratic form.—Let (e;),<i<, be a basis of V. The matrix 
of Q with respect to this basis is the matrix A = (a,;) where a;; = ee 
it is symmetric. If x = Xx,e; is an element of V, then 


Q(x) = Di iXiX;, 


tJ 


yp 


which shows that Q(x) is a “quadratic form” in x,,..., x, in the usual 
sense. 

If we change the basis (e;) by means of an invertible matrix X, the matrix 
A’ of Q with respect to the new basis is X.A.'X where 'X denotes the trans- 
pose of X. In particular 


det(A’) = det(A). det(X)’, 


which shows that det(A) is determined up to multiplication by an element of 
k*?: it is called the discriminant of Q and denoted by disc(Q). 


27 


28 Quadratic forms over Q, and over Q 


1.2. Orthogonality 


Let (V, Q) be a quadratic module over k. Two elements x, y of V are 
called orthogonal if x.y = 0. The set of elements orthogonal to a subset H 
of V is denoted by H®; it is a vector subspace of V. If V, and V, are two 
vector subspaces of V, they are said to be orthogonal if V, < V3, i.e. if xe V;. 
y € V, implies x.y = 0. 

The orthogonal complement V° of V itself is called the radical (or the 
kernel) of V and denoted by rad(V). Its codimension is called the rank of Q. 
If V° = 0 we say that Q is nondegenerate; this is equivalent to saying that 
the discriminant of Q is + 0 (in which case we view it as an element of the 
group k*/k*?). 

Let U be a vector subspace of V, and let U* be the dual of U. Let 
qu: V — U* be the function which associates to each x € V the linear form 
(ye Ut x.y). The kernel of gy is U°. In particular we see that Q is non- 
degenerate if and only if gy: V — V* is an isomorphism. 


Definition 2.—Let U,,..., U,, be vector subspaces of V. One says that 
V is the orthogonal direct sum of the U; if they are pairwise orthogonal and 
if V is the direct sum of them. One writes then: 


V = U,®...@U,. 
Remark.—lf x « V has for components x; in U,, 


O(x) = Oily) +... + Qn(%m)s 


where Q; = Q|U; denotes the restriction of Q to U;. Conversely if (U;, Q;) 
is a family of quadratic modules, the formula above endows V = @U;, 
with a quadratic form Q, called the direct sum of the Q;, and one has 


V= U.®... QU, 
Proposition 1.—/f U is a supplementary subspace of rad(V) in V, then 
V=U@®@ rad(V). 
This is clear. 
Proposition 2.—Suppose (V, Q) is nondegenerate. Then: 
1) Allmetric morphisms of V into a quadratic module (V', Q') are injective. 
li) For all vector subspaces U of V, we have 
y°° = U, dim U+dim U® = dim V, rad(U) = rad(U°) = UN U®. 


The quadratic module U is nondegenerate if and only if U° is nondegenerate, in 
which case V = U® U®. 

11) Lf V is the orthogonal direct sum of two subspaces, they are nondegenerate 
and each of them is orthogonal to the other. 


If f; V+ V’ is a metric morphism, and if f(x) = 0, we have 
xy = f(x).f(y) = 0 for all ye V; 
this implies x = 0 because (V, Q) 1s nondegenerate. 
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If U is a vector subspace of V, the homomorphism qy: V — U* defined 
above is surjective; indeed, it is obtained by composing gy: V — V* with 
the canonical surjection V* — U* and we have supposed that q, is bijective. 
Thus we have an exact sequence: 


0+ U° +V-—->U* —0, 


hence dim V = dim U* + dim U® = dim U+dim U°. 

This shows that U and U®°® have the same dimension; since U is con- 
tained in U°°, we have U = U°°; the formula rad(U) = UN U® is clear; 
applying it to U°, and taking into account that U°® = U, we get rad(U°) = 
rad(U) and at the same time the last assertion of 11). Finally iii) is trivial. 


1.3. Isotropic vectors 


Definition 3.— An element x of a quadratic module (V, Q) is called isotropic 
if O(x) = 0. A subspace U of V is called isotropic if all its elements are iso- 
tropic. 

We have evidently: 


U isotropic <= Uc U° = QIU =0. 


Definition 4.—A quadratic module having a basis formed of two isotropic 
elements x,y such that x.y + 0 is called a hyperbolic plane. 
After multiplying y by I/x.y, we can suppose that x.y = 1. Then the 


matrix of the quadratic form with respect to x, y is simply () aE its 


discriminant is —1 (in particular, it is nondegenerate). 


Proposition 3.—Let x be an isotropic element +0 of a nondegenerate 
quadratic module (V, Q). Then there exists a subspace U of V which contains 
x and which is a hyperbolic plane. 

Since V is nondegenerate, there exists z<¢ V such that x.z = 1. The 
element y = 2z—(z.z)x is isotropic and x.y = 2. The subspace U = kx+ky 
has the desired property. 


Corollary.—/f (V, Q) is nondegenerate and contains a nonzero isotropic 
element, one has Q(V) = k. 

(In other words, for all aek, there exists ve V such that Q(v) = a.) 

In view of the proposition, it suffices to give the proof when V is a 
hyperbolic plane with basis x, y with x, y isotropic and x.y = 1. If ack, 


then a = o(=+57) and from this OQ(V) = k. 


1.4. Orthogonal basis 


Definition 5.— A basis (e,,...,e,) of a quadratic module (V, Q) is called 
orthogonal if its elements are pairwise orthogonal, i.e. if V = ke, 6...8 ké,. 
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This amounts to saying that the matrix of Q with respect to this basis is 
a diagonal matrix: 


a, 0...0 
0 a,...0 
0 0...a, 
If x = Dx,e,;, one has Q(x) = a,x?+... +a,x?. 


Theorem 1.—-Every quadratic module (V, Q) has an orthogonal basis. 

We use induction on n = dim V, the case n = O being trivial. If V is 
isotropic, all bases of V are orthogonal. Otherwise, choose an element 
e, « V such that e,.e, # 0. The orthogonal complement H of e, is a hyper- 
plane and since e, does not belong to H, one has V = ke, @ H; in view of 
the inductive hypothesis, H has an orthogonal basis (e,,...,¢e,), and 
(e,,...,e,) has the desired property. 


Definition 6.—7wo orthogonal bases 
e=(e,,...,e,) ande’ = (e),...,2e,) 


of V are called contiguous if they have an element in common (i.e. if there 
exist i and j such that e; = e;). 


Theorem 2.—Let (V, Q) be a nondegenerate quadratic module of dimension 
23,andlete = (e,,...¢,),e@ = (e;,... e,) be two orthogonal bases of V. There 
exists a finite sequence e, e@), ...,e™ of orthogonal bases of V such that 
e) = ee) — e' ande is contiguous with e“*! for0 S i < m. 

(One says that e,..., e” is a chain of orthogonal bases contiguously 
relating e to e’) 

We distinguish three cases: 

i) (€,.€1) (ej-e/) —(e,€/)? + 0 

This amounts to saying that e, and e, are not proportional and that the 

plane P = ke, +ke, is nondegenerate. There exist then e,, e; such that 


P = ke, @ ke, and P = ke; ® ke3. 


Let H be the orthogonal complement of P; since P is nondegenerate, we 
have V = H @ P, see prop. 2. Let (e3,..., e,) be an orthogonal basis of H. 
One can then relate e to e’ by means of the chain; 


” “ / , ” Mu ‘ 
e > (€), £2, €3,....€,) > (@;, 3, €3,...,e, Je, 


hence the theorem in this case. 
ii) (€1.€;) (€3-€3) —(€1-e3)? + 0 
Same proof replacing e; by e3. 
iii) (e,.e,) (e.e;) —(e,e;)? = 0 for i = 1, 2. 
We prove first: 


Lemma.—There exists x ¢k such that e, = e;+xe, is nonisotropic and 
generates with e, a nondegenerate plane. 
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We have e,.e, = e;.e€;+x7(e3.e2); we must thus take x? distinct from 
—(e;.e;)/(e3.e3). Moreover, for e, to generate with e, a nondegenerate plane, 
it is necessary and sufficient that 


(e, €;) (e,.e,) ~ (e, ey)” # 0. 


If we make this explicit, taking into account the hypothesis ili), we find 
that the left hand side is —2x(e,.e;) (e;.e,). Now hypothesis iii) implies 
e,.e; + 0 for i= 1, 2. We see thus that e, verifies the conditions of the 
lemma if and only if we have x 4 0 and x? # —(e}.e;)/(e3.e3). This elimin- 
ates at most three values of x; if k has at least 4 elements, we can find one 
such x. There remains the case k = F, (the case k = F, is excluded because 
char(k) # 2). But, then, all non-zero squares are equal to 1 and hypothesis 
iit) can be written (e,.e,) (e;-e;) = 1 for 1 = 1, 2; the expression (e;.e;)/ 
(e3.e;) is thus equal to 1, and, in order to realize the condition x? + 0, —1, 
it suffices to take x = I. 

This being so, let us choose e, = e; + xe, verifying the conditions of the 
lemma. Since e, is not isotropic, there exists e; such that (e,, e3) is an 
orthogonal basis of ke; @ kej. Let us put 


e” = (@,, €3, €3,..-5&,)3 
it is an orthogonal basis of V. Since ke,+ke, is a nondegenerate plane, 
part i) of the proof shows that one can relate e to e” by a chain of contiguous 
bases; since e’ and e” are contiguous, the theorem follows. 


1.5. Witt’s theorem 


Let (V, Q) and (V’, Q’) be two nondegenerate quadratic modules; let 
U be a subvector space of V, and let 


si:U->V' 


be an injective metric morphism of U into V’. We try to extend s to a sub- 
space larger than U and if possible to all of V. We begin with the case 
where U is degenerate: 


Lemma.—/f U is degenerate, we can extend s to an injective metric 
morphism s,: U,—» V' where U, contains U as a hyperplane. 

Let x be a non-zero element of rad(U), and / a linear form on U such 
that /(x) = 1. Since V is nondegenerate, there exists ye V such that /(u) = 
u.y for all ue U; we can moreover assume that y.y = 0 (replace y by y— Ax, 
with A = }y.y). The space U, = U @ ky contains U as a hyperplane. 

The same construction, applied to U’ = s(U), x’ = s(x) and I’ = /° 57! 
gives y’e V’ and U; = U’ @ ky’. One then checks that the linear map 
s,: U, - V’ which coincides with s on U and carries y onto y’ is a metric 
isomorphism of U; onto Uj. 


Theorem 3 (Witt).—Jf (V, Q) and (V’, Q’) are isomorphic and non- 
degenerate, every injective metric morphism 


s;U->V’ 
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of a subvector space U of V can be extended to a metric isomorphism of V 
onto V". 

Since V and V’ are isomorphic, we can suppose that V = V’. Moreover, 
by applying the above lemma, we are reduced to the case where U 1s non- 
degenerate. We argue then by induction on dim U. 

If dim U = 1, U ts generated by a nonisotropic element x; if y = s(x), 
we have y.y = x.x. One can choose « = +1 such that x+y is not isotropic; 
otherwise, we would have 


2x.x+2x.y = 2x.x-—2x.y = 0 


which would imply x.x = 0. Choose such an e, and let H be the orthogonal 
complement of z = x+ey; we have V = kz @ H. Let o be the “symmetry 
with respect to H’’, i.e. the automorphism of V which is the identity on H 
and which changes z to —z. Since x—e«y is contained in H, we have 


o(x—ey) = x—ey and o(xtey) = —x-ey, 


hence o(x) = —ey, and the automorphism —eo extends s. 


If dim U > 1, we decompose U in the form U, ® U,, with U,, U, + 0. 
By the inductive hypothesis, the restriction s, of s to U, extends to an 
automorphism o, of V; after replacing s by 0; ' o s, one can thus suppose 
that s is the identity on U,. Then the morphism s carries U, into the ortho- 
gonal complement V, of U,; by the inductive hypothesis, the restriction of 
s to U, extends to an automorphism o, of V,; the automorphism o of V 
which is the identity on U, and oc, on V, has the desired property. 


Corollary.—Two isomorphic subspaces of a nondegenerate quadratic 
module have isomorphic orthogonal complements. 

One extends an isomorphism between the two subspaces to an auto- 
morphism of the module and restricts it to the orthogonal complements. 


1.6. Translations 


Let f(X) = & aiiX/ +2 2 a;;X;X; be a quadratic form in n variables 

i= i<j 
over k; we put a;; = a,; if i > j so that the matrix A = (a;;) is symmetric. 
The pair (k”, f) is a quadratic module, associated to f (or to the matrix A). 


Definition 7.—Two quadratic forms f and f’ are called equivalent if the 
corresponding modules are isomorphic. 


Then we write f ~/f’. If A and A’ are the matrices of f and f’, this 
amounts to saying that there exists an invertible matrix X such that A’ = 
X.A.'X, see n° 1.1. 


Let f(X,,..., X,) and g(X,,..., X,,) be two quadratic forms; we will 
denote f+g (or simply f+g if no confusion is possible) the quadratic form 
I(x, ne. X)+Q(Xn 41 v9 Xn+m) 


in n+m variables. This operation corresponds to that of orthogonal sum 
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(see def. 2, n° 1.2). We write similarly f—g (or simply f—g) for f+(—g). 
Here are some examples of translations: 


Definition 4’—A form f(X,, X,) in two variables is called hyperbolic if 
we have 
f~ X,X3 ~m X7-X}. 


(This means that the module (k’, /') corresponding is a hyperbolic plane, 
cf. def. 4). 

We say that a form /(X;,..., X,) represents an element a of k if there 
exists x ¢k", x # 0, such that f(x) = a. In particular f represents 0 if and 
only if the corresponding quadratic module contains a non-zero isotropic 
element. 


Proposition 3’.—/f f represents 0 and is nondegenerate, one has f ~ f. +8 
where f, is hyperbolic. Moreover, f represents all elements of k. 
This is a translation of prop. 3 and its corollary. 


Corollary 1.—Let g = g(X,;,...,X,-,) be a nondegenerate quadratic 
form and let ae k*. The following properties are equivalent: 


(1) g represents a. 
(ii) One has g ~h+aZ? where h is a form in n—2 variables. 
(iii) The form f = g—aZ* represents 0. 


It is clear that (ii) > (i). Conversely, if g represents a, the quadratic 
module V corresponding to g contains an element x such that x.x = a; if H 
denotes the orthogonal complement to x, we have V = H @ kx, hence 
g ~h+aZ? where h denotes the quadratic form attached to a basis of H. 

The implication (ii) => (iii) is immediate. Finally, if the form f = g--aZ? 
has a nontrivial zero (x,,...,X,—- 1, Z) we have either z = 0 in which case 
g represents 0 thus also a, or z + Oin which case g(x,/z,...,X,—,/Z) = @. 
Hence (iii) = (i). 


Corollary 2.—Let g and h be two nondegenerate forms of rank 2\, and 
let f = g—h. The following properties are equivalent: 


(a) f represents 0. 
(b) There exists ae k* which is represented by g and by h. 
(c) There exists ae k* such that g—aZ? and h—aZ? represent 0. 


The equivalence (b) <(c) follows from corollary 1. The implication 
(b) = (a) is trivial. Let us show (a) > (b). A nontrivial zero of f can be 
written in the form (x, y) with g(x) = A(y). If the element @ = g(x) = A(y) 
is + O, it is clear that (b) is verified. If a = 0, one of the forms g for example, 
represents 0, thus all elements of k, and in particular all non-zero values 
taken by Ah. 


Theorem 1 translates into the classical decomposition of quadratic 
forms into "sums of squares”: 
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Theorem 1'.—Let f be a quadratic form in n variables. There exists 
Q;,...,@, €k such that f ~a,X}?+...+a,X). 

The rank of fis the number of indices i such that a; + 0. It is equal to 
nif and only if the discriminant a, ...a, of fis #0 (in other words, if f is 
nondegenerate). 

Finally the corollary to Witt’s theorem gives the following ‘“‘cancellation 
theorem’’: 


Theorem 4.—Let f = g+handf’ = g' +h’ be two nondegenerate quadratic 
forms. If f ~f' and g ~ g’, one hash ~h'. 


Corollary.—If f is nondegenerate, then 
fret... +g,+h 
where g,,..., 8, are hyperbolic and h does not represent 0. This decomposition 
is unique up to equivalence. 
The existence follows from prop. 3’, and the uniqueness from theorem 4. 


{The number m of hyperbolic factors can be characterized as the dimen- 
sion of the maximal isotropic subspaces of the quadratic module defined by f/] 


1.7. Quadratic forms over F, 


Let p be a prime number #2 and let g = p/ a power of p; let F, be a 
field with g elements (cf. chap. I, §1). 


Proposition 4.—A quadratic form over F, of rank 22 (resp. of rank 23) 
represents all elements of Ff (resp. of F,). 

In view of cor. 1 of prop. 35 it suffices to prove that all quadratic forms in 
3 variables represent 0 and this has been proved in chap. I, §2, as a con- 
sequence of Chevalley theorem. 

[Let us indicate how one can prove this proposition without using 
Chevalley theorem. One has to show that, if a, b, ce F, are not zero, the 
equation 

(*) ax?+by? =c 


has a solution. Let A (resp. B) be the set of elements of F, of the form ax? 
(resp. of the form c—by’) with x ¢F, (resp. with y ¢ F,). One sees easily 
that A and B have each (¢+1)/2 elements; thus A A B + @ from which one 
gets a solution of (*).] 

Recall now that the group FF has two elements (chap. I, n° 3.1). 
Let a denote an element of FY which is not a square. 


Proposition 5.—Every nondegenerate quadratic form of rank n over F, is 
equivalent to 
Xi+...4+X7_, +X? 
or 
X24...4X2_,4+aX?2, 


depending on whether its discriminant is a square or not. 
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This is clear ifm = 1. Ifn = 2, prop. 4 shows that the form f represents 1. 
It is thus equivalent to X¥?+g where g is a form in n—1 variables and one 
applies the inductive hypothesis to g. 


Corollary.—For two nondegenerate quadratic forms over F, to be equivalent 
it is necessary and sufficient that they have same rank and same discriminant. 

(Of course the discriminant is viewed as an element of the quotient 
group F*/F*?.) 


§2. Quadratic forms over Q, 


In this paragraph (n° 2.4 excepted) p is a prime number and k is the 
p-adic field Q,. 

All quadratic modules are over k and nondegenerate; we make the same 
conventions for the quadratic forms. 


2.1. The two invariants 


Let (V, Q) be a quadratic module of rank n and d(Q) its discriminant; 
it is an element of k*/k*?, cf. n° 1.1. If e = (e,,...e,) is an orthogonal 
basis of V, and if we put a; = e;.e,;, we have 


d(Q) =a,...a, (in k*/k*?) 


(In what follows we will often denote by the same letter an element of k* 
and its class modulo k*?.) 

Recall now that, if a and 5 are elements of k*, we have defined in chap. 
III, n° 1.1, the Hilbert symbol (a, 6), equal to +1. We put 


e(e) = I] (a;, a,). 


One has e(e) = +1. Moreover e(e) is an invariant of (V, Q): 


Theorem 5.—The number «(e) does not depend on the choice of the 
orthogonal basis e. 

If n = 1, one has e(e) = 1. If m = 2, one has e(e) = 1 if and only if 
the form Z?—a,X*—a,Y? represents 0, that is to say (cf. cor. 1 to prop. 3’) 
if and only if a,X*+a,Y? represents 1; but this last condition signifies that 
there exists v « V such that Q(v) = | and this does not depend on e. For 
n 2 3 we use induction on n. By th. 2 it suffices to prove that e(e) = e(e’) 
when e and e’ are contiguous. In view of the symmetry of the Hilbert symbol, 
e(e) does not change when we permute the e,; we can thus suppose that 
e’ = (e;...,e,) is such that e, = e,. If we put a; = e}.e;, then aj = a. 
One can write e(e) in the form 


e(e) = (a,,a,...a,) |] (4; 4;) 


2Si<j 
= (a,4(Q)a) JT (aya) 


since d(Q) = a,...4,. 
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Similarly 


e(e’) = (a, d(Q)ay) TT (ai, a) 


But the inductive hypothesis applied to the orthogonal complement of e, 
shows that 


I] @.a)= [] G, 4), 
2si<j 2si<j 
from which the desired result follows. 
We write from now on e¢(Q) instead of e(e). 


Translation.—If f is a quadratic form in n variables and if 


f~a,XP+...+a,X?2, 
the two elements 
d(f) =a,...a, (in k*/k*?) 
e(f) = [] G, a;) (in {+1 }) 
i<j 


are invariants of the equivalence class of /. 


2.2. Representation of an element of k by a quadratic form 


Lemma.—a) The number of elements in the ¥F,-vector space k*/k*? is 
2’ with r = 2 if p + 2 andr = 3 if p = 2. 

b) If ae k*/k*? and « = +1, let H® be the set of x ¢k*/k*? such that 
(x, a) = «. Ifa =1, H} has 2" elements and HZ! = @. Ifa + |, HE has 
2°~! elements. 

c) Leta,a’ ¢k*/k** ande,e' = +1; assume that Hé and H® are nonempty. 
For H' (\ H'. = @, it is necessary and sufficient that a = a’ ande = —e'. 

Assertion a) has been proved in chap. II, n° 3.3. In b) the case a = 1 
is trivial; of a 4 1, the homomorphism b+ (a, b) carries k*/k*? onto {+1} 
(chap. III, n° 1.2, th. 2); its kernel H! is thus a hyperplane of k*/k*? and 
has 2”~' elements; its complement H;' has 2’~' elements (it is an ‘‘affine”’ 
hyperplane parallel to H!). Finally, if Hé and H‘ are nonempty and disjoint, 
they have necessarily 2’~' elements each and are complementary to one 
another; this implies H) = H}, hence 


(x, a) = (x,a’) forall x ek*/k*?; 
since the Hilbert symbol is nondegenerate, this implies a = a’ and e = —e’. 
The converse is trivial. 
Let now f be a quadratic form of rank n; let d = d(f) and e = e(f) be 
its two invariants. 


Theorem 6.—For f to represent O it is necessary and sufficient that: 


i)n =2andd= —1 (in k*/k*?), 
li) n = 3 and(—1, —d) =e, 
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il) n = 4 and either d + 1 ord = | ande = (—1, —1). 

iv) n2 5 

(In particular, all forms in at least S variables represent 0.) 


Before proving the theorem, let us indicate a consequence of it: let 
aek*/k** and f, = f—aZ?. We know (cf. n° 1.6) that f, represents 0 if and 
only if f represents a. On the other hand, 


Af.) = —ad, e(f,) = (-a, d)e, 
as one checks right away. By applying theorem 6 to f,, and taking into 
account the above formulas, we obtain: 
Corollary. Let a « k*/k**. In order that f represent a it is necessary and 
sufficient that: 


1) n= 1 anda = d, 
ll) n = 2 and (a, —d) = e, 
lil) n = 3 and eithera + —dora = —dand(-—1, —d) =e, 
iv) n 2 4. 
(Note that, in this statement as in theorem 6, a and d are viewed as 
elements of k*/k*?; the inequality a + —d means that a is not equal to the 
product of —d by a square.) 


Proof of theorem 6.—We write f in the form f ~ a,X?7+...+a,X, and 
consider separately the cases n = 2, 3, 4 and 25. 


1) The case n = 2. 
The form f represents 0 if and only if —a,/a, is a square; but —a,/a, = 
—a,a, = —d in k*/k*?; hence this means that d = —1. 


li) The case n = 3. 
The form f represents 0 if and only if the form 


2 2 2 
—a,f ~ —ajza,X; —a,a,Xz— X; 


represents 0. Now by the very definition of the Hilbert symbol, this last form 
represents 0 if and only if we have 


(—@3a,, —a3Q>) = |, 
Expanding this, we find: 


(-1, - 1)(-1, a,)(- I, a,)(a;, a,)(a,, a,)(a,, a;)(a, a3) =] 
But one has (a;, a,)=(—1, a3), cf. chap. III, n° 1.1, prop. 2, formula iv). 
One can thus rewrite the above condition in the form 
(-1, - 1) (- l, a,a243) (a, a>) (a,, a3) (a, a3) =I, 
or (—1, —d)e = I, i.e. (-1, —d) =e. 


ili) The case n = 4. 
By cor. 2 to prop. 3’, f represents 0 if and only if there exists an element 
x €k*/k*? which is represented by the two forms 
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a,X/+a,X} and —a,X}-—a,X}. 
By case 11) of the corollary above, such an x is characterized by the conditions 
(x, —@,@2) = (ay, a2) and (x, —a3a4) = (—43, —ay). 


Let A be the subset of k*/k*? defined by the first condition, and let B be the 
subset defined by the second. In order that f does not represent 0, it is neces- 
sary and sufficient that AN B= 2. Now A and B are clearly nonempty 
(one has a, ¢ A and —a, « B for example). By part c) of the lemma given at 
the beginning of this n°, the relation AM B = @ ts thus equivalent to 


QQ, = A304 and (a,,@,) = —(—a3—4,). 
The first condition means that d = ]. If it is fulfilled one has 


€ = (Qj, Az) (43, A4) (A304, 2304); 


by using the relation (x, x) = (—1, x) (cf. chap. TI, n° 1.1, formula iv) of 
prop. 2) we get from this: 


€ = (@,, 42) (a3, a4) (—1, @344) 
= (a, 42) (—a3, a,)(—1, —1). 


Hence the second condition can be written e = —(—1, —1), from which the 
result follows. 


iv) The case n 2 5. 

It is sufficient to treat the case n = 5. By using the lemma and part ii) 
of the above corollary, we see that a form of rank 2 represents at least 2’~ ' 
elements of k*/k**, and, the same is a fortiori true for the form of rank =2. 
Since 2’"' = 2, f represents at least one element a ¢ k*/k*? distinct from d. 
One has 

f ~aX* +8, 
where g 1s a form of rank 4. The discriminant of g is equal to d/a; it is thus 
different from I, and, by i), the form g represents 0. The same is then true 
for f, and the proof of theorem 6 is complete. 


Remarks.—1) Let f be a quadratic form not representing 0. The above 
results show that the number of elements of k*/k*? which are represented 
by fis equal to | if = 1, to 2’"' ifn = 2, to 2’—1 if m = 3, and to 2' if 
n = 4, 

2) We have seen that all quadratic forms in 5 variables over Q, represent 
0. In this connection, let us mention a conjecture made by E. Artin: al/ 
homogeneous polynomials of degree d over Q, in at least d* +1 variables have 
a nontrivial zero. The case d = 3 has been solved affirmatively (see, for 
example, T. SPRINGER, Koninkl. Nederl. Akad. van Wetenss., 1955, pp. 512- 
516). The general case remained open for about thirty years. It was only in 
1966 that G. TERJANIAN showed that Artin’s conjecture is false: there exists 
a homogeneous polynomial of degree 4 over Q, in 18 variables which has no 
nontrivial zero. Terjanian starts from the polynomial 
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n(X, Y, Z) = X?YZ+Y?ZX4Z7°XV4+X7Y74+Y°Z74+Z7X? — X4*-Y*-Z?4, 


which has the property that n(x, y, z) = —1 (mod 4) if (x, y, z) is primitive 
in (Z,)>. Let 


I(X,, «2, Xg) = m(X,, Xz, X3) + (Ng, Xs, Xo) + (X37, Xs, Xo); 


one has f (x,,..., X9) = 0 (mod 4) if (x,,..., x9) 1S primitive. From this 
one deduces easily that the polynomial 


F(X, .- +5 X18) =/(%,~.«, Xo) +4Y(MNi05- - +s X18) 


does not have a non-trivial zero. (There exist analogous examples—but of 
higher degrees—for all the Q,.) 

One knows nevertheless that Artin’s conjecture is ‘‘almost” true: for 
a fixed degree d, it holds for all prime numbers p except a finite number 
(Ax-KOCHEN, Amer. J. of Math., 1965); however, even for d = 4, one does 
not know how to determine the set of exceptional prime numbers. 


2.3. Classification 


Theorem 7.—7Two quadratic forms over k are equivalent if and only if they 
have the same rank, same discriminant, and same invariant e. 

That two equivalent forms have the same invariants follows from the 
definitions. The converse is proved by induction on the rank n of the two 
forms f and g considered (the case n = 0 being trivial). Corollary to theorem 6 
shows that f and g represent the same elements of k*/k**; one can thus 
find a e k* which is represented at the same time by f and by g; this allows 
One to write: 

f ~aZ’?+f' and g ~aZ* +g’, 
where f’, g’ are forms of rank n—1. One has 
df’) = ad(f) = ad(g) = d(g’) 
e(f") = e(f) (a, af") = e(g) (a, d(g’)) = e(g’), 


which shows that f’ and g’ have the same invariants. In view of the inductive 
hypothesis, we have f’ ~ g’, hence f ~ g. 


Corollary.—Up to equivalence, there exists a unique form of rank 4 


which does not represent 0; if (a, b) = —1, it is the form z* —ax* — by? +abt?. 
Indeed, by th. 6, such a form is characterized by 


d(f) = 1,e€f) = -(-1,-)) 


and a simple computation shows that z?—ax*—by*+abt* has these pro- 
perties. 


Remark.—This form is the reduced norm of the.unique non-commutative 
field of degree 4 over Q,; this field can be defined as a field ‘tof quaternions” 
with basis {1, i, j,k} where i? = a,j? = b,ij = k = —ji, and (a, b) = —1. 
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Proposition 6.—Let n 2 1, d¢k*/k*? and « = +1. In order that there 
exists a quadratic form f of rank n such that d(f) = d and &f) = « it is 
necessary and sufficient thatn = 1,e = ljorn=2,d + —ljorn=2,¢=1; 
orn 2 3. 

The case n = | is trivial. If n = 2, one has f~aX?+6Y? and, 


if d(f)= —1, then e(f) = (a, 6) = (a, —ab) = 1; 


thus we cannot have simultaneously d(/) = —1 and e(f) = —1. Con- 
versely, if d= —l,e= 1, we take f= X¥?- Y’; if d # —1, there exists 
aek* such that (a, —d) = e and we take f = aX*+adY’. 

If n = 3, we choose a ¢ k*/k*? distinct from —d; by what we have just 
seen, there exists a form g of rank 2 such that d(g) = ad, e(g) = e(a, —d); 
the form aZ?+g works. The case n 2 4 is reduced to the case n = 3 by 
taking f = 2(X,, X,, X3)+X2+...+X2 where g has the required invariants. 


Corollary.—The number of classes of quadratic forms of rank n over 
Q, for p + 2 (resp. p = 2) is equal to 4 (resp. 8) ifn = 1, to 7 (resp. 15) if 
n = 2, and to 8 (resp. 16) ifn 2 3. 

Indeed, d(f) can take 4 (resp. 8) values and e(f) can take 2 values. 


2.4. The real case 


Let f be a quadratic form of rank n over the field R of real numbers. 
We know that f is equivalent to 


X? +. . +X7- y?_. . .— ¥2, 


where r and s are two integers 20 such that r+s = n; the pair (r, s) depends 
only on f; it is called the signature of f; We say that fis definite if r or s = 0, 
ie. if f does not change sign; otherwise, we say that f is indefinite. (This is 
the case where / represents 0.) 

The invariant e(f) is defined as in the case of Q,; due to the fact that 
(-—1, —1) = —1, we have 


l if s = 0, 1 (mod 4) 
—_ —| s(s—1)/2 — ’ 
(fy =(-) —1 if s = 2, 3 (mod 4). 
Moreover: 
1 if s = 0 (mod 2 
df) =(-yr =) 1S = 0 mod?) 
—1 ifs = 1 (mod 2). 
We see that the knowledge of d(f) and e(f) is equivalent to that of the 
class of s modulo 4; in particular, d({) and e(f) determine f up to equivalence 
ifn s 3. 
One also checks that parts i), ii), iii) of th. 6 and its corollary are valid 
for R (indeed their proofs use only the nondegeneracy of the Hilbert symbol, 
and this applies to R); it is clear that part iv) does not extend. 
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§3. Quadratic forms over Q 


All quadratic forms considered below have coefficients in Q and are 
nondegenerate. 


3.1. Invariants of a form 


As in chap. III, §2, we denote by V the union of the set of prime numbers 


and the symbol oo, and we put Q,, = R. 
Let f ~a,X/7+...+a,X? be a quadratic form of rank n. We associate 


to it the following invariants: 


a) The discriminant d(f) ¢ Q*/Q*? equal to a, ...a,. 

b) Let ve V. The injection Q —> Q, allows one to view f as a quadratic form 
(which we will denote f,) over Q,. The invariants of f, will be denoted by 
d,(f) and e,(/); it is clear that d,(f) is the image of d(f) by Q*/Q*? >Q*/Q*?; 


we have 
ef) = Il@, a;)y. 


i<j 
The product formula (chap. III, n° 2.1, th. 3) gives the relation 
[]e(f) = 1. 


veV 


c) The signature (r, s) of the real quadratic form f is another invariant of / 


The invariants d,(f), «,(f), and (r, s) are sometimes called the local 
invariants of f. 


3.2. Representation of a number by a form 


Theorem 8 (Hasse-Minkowski).—Jn order that f represent 0, it is necessary 
and sufficient that, for all ve V, the form f, represent 0. 
(In other words: f has a “global” zero if and only if f has everywhere a 
“*local’’ zero.) 
The necessity is trivial. In order to see the sufficiency, we write f in the 
form 
f=a,X7+...+a,X7, a; ¢Q*. 


Replacing f by a,f, one can moreover suppose that a, = 1. We consider 
separately the cases n = 2, 3, 4 and 25. 


1) The case n = 2. 
We have f = X?—aX}; since f,, represents 0, a is >0. If we write a in 


the form 
a= [[ pb’, 
Pp 


the fact that f, represents 0 shows that a is a square in Q,, hence that v,(a) is 
even. From this follows that a is a square in Q and / represents 0. 
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li) The case n = 3 (Legendre). 

We have f = X?—aX}—bX}?; being free to multiply a, 6, by squares, 
we can assume that a and b are square free integers (i.e. v,(a), v,(b) are equal 
to 0 or | for all prime numbers p). Also we can assume that |a| < |b]. We 
use induction on the integer m = |a|+ |b]. If m = 2, we have 


f= XPAXPE XG; 


the case of 7+ X74 X? is excluded because f,, represents 0; in the other 
cases, f represents zero. 
Suppose now that m > 2, i.e. |b] 2 2 and write b in the form 


b= +p,..- Dp, 


where the p; are distinct primes. Let p be one of the p;; we are going to 
prove that a is a square modulo p. This ts obvious if a = 0 (mod p). Otherwise 
a is a p-adic unit; by hypothesis, there exists (x, y, z) €(Q,)* such that 
z*—ax*—by? = 0 and we can suppose that (x, y, z) is primitive (cf. chap. II, 
n° 2.1, prop. 6). We have z7—ax? = 0 (mod p). From this follows that, if 
x = 0 (mod p), the same is true also for z, and by? is divisible by p*; since 
v,(b) = 1 this implies y = 0 (mod p) contrary to the fact that (x, y, z) 1s 
primitive. Thus we have x = 0 (mod p), which shows that a is a square 
(mod p). Now, since Z/bZ = I1Z/p;Z, we see that a is a square modulo b. 
There exist thus integers ¢, b’ such that 


t? = a+bb' 


and we can choose ¢ in such a way that |r| < [6|/2. The formula bb’ = t?—a 
shows that bb’ is a norm of the extension k(/a)/k where k = Q or Q;; 
from this we conclude (the argument is the same as that for prop. | of chap. 
III), that f represents 0 in k if and only if the same is true for 


f' = X?—aX}—b'X}. 
In particular, f’ represents 0 in each of the Q,. But we have: 


|O| 


t?—a 
——|s 7 + 1 < |b] because |b] 2 2. 


[b'| = 


Write 6’ in the form 6b’u* with 6b”, u integers and 6b” square free; we 
have a fortiori |b"| < |b]. The induction hypothesis applies thus to the 
form f” = X?—aX}—b’X? which is equivalent to f’; hence this form 
represents 0 in Q and the same ts true for / 


il) The casen = 4. 

Write f = aX7+bX}—-(cX?+dX}7). Let ve V. Since f, represents 0, 
cor. 2 of prop. 3’ of n° 1.6 shows that there exists x, ¢ Q* which is represented 
both by aX¥?+bX? and by cX?+dX}; by part ii) to cor. of th. 6 (which 
applies equally well to Q,, = R), this is equivalent to saying that 


(x,, —ab), = (a,b), and (x,, —cd), = (c,d), forallue V. 
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Since W (a, 6), = W (c, d), = 1, we can apply th. 4 of chap. III, n° 2.2 and 
obtain from it the “existence of x « Q* such that 
(x, —ab), = (a,b), and (x, —cd), = (c, d), for all ve V. 


The form aX?+5X}—~xZ? represents zero in each of the Q, hence in Q by 
what we have just seen. Hence x is represented in Q by aX;?+bX}, and the 
same argument applies to c¥;7+dX?; the fact that f represents 0 follows 
from this. 


lv) The case n 2 5. 
We use induction on n. We write f in the form 


f=h-g 


with h = a,X?+a,X}, g = —(a,X7+....4+a,X?). 

Let S be the subset of V consisting of 00, 2, and the numbers p such that 
v,(a;) + 0 for one i 2 3; it is a finite set. Let v e S. Since f, represents 0, 
there exists a, ¢ Q* which is represented in Q, by / and by g; there exist 
x? €Q,, i= 1,...,”, such that 


A(x], X3) = Ay = B(X3, - . - » Xp) 


But the squares of Q* form an open set (cf. chap. II, n° 3.3). Using the 
approximation theorem (chap. III, n° 2.2, lemma 2), this implies the existence 
of x;, x, €Q such that, if a = h(x,, x2), one has a/a, « Q*? for all ve S. 
Consider now the form /, = aZ’?~g. If v ¢ S, g represents a, in Q,, thus also 
a because a/a, « Q**; hence f, represents 0 in Q,. If v ¢ S, the coefficients 
—A3,..., ~—a, Of g are v-adic units; the same is true of dg), and because 
v # 2 we have e,(g) = 1. [This could also be deduced from cor. 2 to th. | of 
chap. II, n° 2.2. combined with Chevalley’s theorem.] In all cases, we 
see that f, represents 0 in Q,; since the rank of f, is n—1, the inductive 
hypothesis shows that /, represents 0 in Q, i.e. g represents a in Q; since h 
represents a, f represents 0, and the proof is complete. 


Corollary 1.—Let a < Q*. In order that f represents a in Q it is necessary 
and sufficient that it does in each of the Q.,,. 
This follows from the theorem applied to the form aZ?--f. 


Corollary 2 (Meyer).—A quadratic form of rank =5 represents 0 if and 
only if it is indefinite (i.e. if it represents 0 in R). 
Indeed, by th. 6, such a form represents 0 in each of the Q,. 


Corollary 3.—Let n be the rank of f. Suppose that n = 3 (resp. n = 4 
and df) = 1.) If f represents 0 in all the Q, except at most one, then f repre- 
sents Q. 

Suppose that n = 3. By th. 6, f represents 0 in Q, if and only if we have: 


(*), (-1, -d(f)), = e(f). 
But the two families «,(f),(—1, —d(/)), satisfy the product formula of chap. 
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III, n° 2.1. From this follows that, if (*), is true for all v except at most one 
(*), 1S true for all v; by th. 8, f represents 0. 

When n = 4 and d(/) = 1 we argue in the same way, the equality (x), 
being replaced by (—1, —1), = ¢,(/). 


Remarks.—\) Suppose that n = 2 and that f represents 0 in all the Q, 
except a finite number. One can then show, by means of the theorem on 
arithmetic progressions (cf. chap. VI, n° 4.3) that f represents 0. 

2) Th. 8 does not extend to homogeneous polynomials of degree > 3; 
for example, Selmer has proved that the equation 


3V¥374+4Y°4+5Z° =0 


has a nontrivial solution in each of the Q, but none in Q. 
3.3. Classification 


Theorem 9.—Ler f and f’ be two quadratic forms over Q. For f and f’ to 
be equivalent over Q it is necessary and sufficient that they are equivalent 
over each Q,. 

The necessity is trivial. To prove the sufficiency, we use induction on 
the rank 7 of f and f’. If n = 0, there is nothing to prove. Otherwise, there 
exists a € Q* represented by f, thus also by /’ (cf. cor. 1 to th. 8). Thus we 
have f ~aZ*+g, f' ~ aZ* +g’. By th. 4 of n° 1.6, we have g ~ 8" over 
Q, for all v ¢ V. The induction hypothesis then shows that g ~ g’ over Q, 
hence f ~ f”. 


Corollary.—Let (r, s) and (r’, s‘) be the signatures of f and of f'. For f and 
J’ to be equivalent it is necessary and sufficient that one has 


Af)= af), (8) = (r',5'), ande(f) = (f’) forall ve V. 


Indeed these conditions just mean that f and f’ are equivalent over each 
of the Q.,. 


Remark.—The invariants d = d(f), ¢, = ¢,(f) and (r, s) are not arbitrary. 
They verify the following relations: 


(1) «, = 1 for almost all v ¢ V and Il e, = 1, 
veV 


(2) «, = lifn = l orifnm = 2 and if the image d, of d in Q*/Q*? is equal 
to —l, 
(3) r,s 2 Oandr+s =n, 


(4) d,. =(-Il), 
(9) e = (- 9 
Conversely: 


Proposition 7.—Let d, (e,),<y, and (r, s) verify the relations (1) to (5) 
above. Then there exists a quadratic form of rank n over Q having for invariants 
d, (e,)vev and (r, 5). 

The case n = | 1s trivial. 
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Suppose that n = 2. Let v e V. The nondegeneracy of the Hilbert symbol, 
together with condition (2), shows that there exists x, ¢Q*® such that 
(x,, —d), = €. From this and condition (1), follows the existence of x « Q* 
such that (x, —d), = «, for all ve V (chap. III, n° 2.2, th. 4). The form 
xX?+4+xdY* works. 

Suppose that n = 3. Let S be the set of v « V such that (—d, —1), = —.,; 
it is a finite set. If v ¢ S, choose in Q*/Q*? an element c, distinct from the 
image —d, of —d in this group. Using the approximation theorem (chap. 
III, n° 2.2, lemma 2), we see that there exists c ¢ Q* whose image in each 
of the Q*/Q*?, vc S, is c,. From what we have just proved follows the 


v > 


existence of a form g of rank 2 such that 
d(g) = cd, ¢,(g) = (c,—d),e, forallve V. 


The form f = cZ? +g then works. [Note that for n $ 3 we do not need to 
consider the signature of the form, since conditions (3), (4), (5) determine 
it as a function of d,, and «,,]. 

When n 2 4 we use induction on n. Suppose first that r 2 1. Using 
the inductive hypothesis, we obtain a form g of rank n—1 which has for 
invariants d, (¢,),-y and (r—1, s); the form X*+g works. When r = 0, 
we use a form A of rank n—1 having for invariants —d, ¢,(—1, —d),, and 
(0, n—1); the form — X* +h works. 


Appendix 
Sums of three squares 


Let n and p be positive integers. We say that n is the sum of p squares 
if n is representable over the ring Z by the quadratic form X7+.. +X, 
le. if there exist integers n,,...,, such that 


n=nit+.. +r 


Theorem (Gauss).—/n order that a positive integer be a sum of three 
squares it is necessary and sufficient that it is not of the form 4°(8b—1) with 
a,beZ. 

(Example: if n is not divisible by 4, it is a sum of three squares if and only 
ifn = 1, 2, 3, 5, 6 (mod. 8).) 


Proof.—We can suppose n is nonzero. The condition ‘‘n is of the form 
4°(8b—1)” is then equivalent to say that —n is a square in Q} (chap. I, 
n° 3.3, th. 4). But we have: 


Lemma A.—Let a €« Q*. In order that a be represented in Q by the form 
f = X?+X2+X}3 it is necessary and sufficient that a is >0 and that —a is 
not a square in Q,. 

By cor. | of th. 8 we have to express that a is represented by f in R and 
in all Q,. The case of R gives the positivity condition. On the other hand the 
local invariants d,(f) and «,(f) are equal to |. If p + 2, one has 
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(- l, —d{(f)), = (- I, _ 1), =|= ent); 


the corollary of theorem 6 thus shows that a is represented by f in Q,. 
If p = 2, we have (-—1, —d,(f)), = —I + e,(f); the same corollary shows 
that a is represented by f in Q, if and only if a is different from —I in 
Q*/Q%?, i.e. if —a@ is not a square in Q,. 


Now we must pass from representations in Q to representations in Z. 
This is done by means of the following lemma: 


Pp 
Lemma B (Davenport-Cassels).—Let f(X) =. 2% aX ;X; be a positive 
eS et 


definite quadratic form, the matrix (a;;) being symmetric and with integer 
coefficients. We make the following hypothesis: 


(H) For every x = (x,,...,X,) € Q? there exists ye Z? such that 
f(x-y) <1. 

If n € Z is represented by f in Q, then n is also represented by f in Z. 

If x = (x,,...,X,) and y = ();,..., yy) are two elements of Q’, we 


denote by x.y their scalar product 2a; ;,x,y;. One has x.x = f(x). 

Let n be an integer represented by f in Q. There exists an integer t > 0 
such that ¢?7n = x.x with x « Z?. Choose t and x in such a way that ¢ is 
minimum; we must prove that t = |. 

By hypothesis (H), there exists y « Z? such that 


=y+z withz.z <I. 


If zz = 0 we have z = 0 and ~ has integer coefficients. Because of the 
f 


minimality of ¢, this implies ¢ = 1. 
Assume now that z.z + 0 and put 


a= yy-n 

b = 2(nt—x.y) 
t' = at+b 

x’ = ax+by. 


One has a, b, t' « Z, and: 

xx’ = a*x.x+2abx.y+b*yy 
a*t?n + ab(2nt —b)+b?(n+a) 
n(a?t? + 2abt + b?) 


= t'*n, 


Moreover: 
at?+bt = t?y.y—nt? +2nt? —2t x.y 


— 

~ 
~ 

Il 


t?y.y—2t x.ytxx = (ty—x).(ty—x) 


= 77,2, 
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hence t’ = ¢ z.z; since 0 < z.z < 1, we have 0 < t’ < t. This contradicts the 
minimality of ¢ and concludes the proof of the lemma. 


To prove the theorem, it suffices now to check that the form f = X/7+ 
X} +X? satisfies condition (H) of lemma B. But this is clear: if (x,, x2, x3) € 
Q* we choose (y,, ¥2, ¥3) € Z°* such that |x;—y,| S$ 4 for all i; we have 
X(x;—y,)? S 3/4 < 1. 


Corollary 1 (Lagrange).— Every positive integer is a sum of four squares. 

Let n be an integer >0. We write n in the form 4°m, where m is not divis- 
ible by 4. If m = 1, 2, 3, 5, 6 (mod 8), m is a sum of three squares, and the 
same holds for n. Otherwise m = —1 (mod 8) and m—1 is sum of three 
squares; in this case m is a sum of four squares, and the same holds for n. 


Corollary 2 (Gauss).—FEvery positive integer is a sum of three triangular 
numbers. 

(A number is called “triangular” if it is of the form m(m+1)/2 with 
me Z,) 

Let n be a positive integer. By applying the theorem to 8n+3, we see 
that there exist integers x,, x,, x3; such that 


xe + xt4+x32 = 8n43. 
One has 
xi+3x3+x2 = 3 (mod 8). 


But the only squares in Z/8Z are 0, 1 and 4; a sum of three squares in 
Z/8Z can be equal to 3 only if each of its terms equal 1. This shows that the 
x, are odd, and one can write them in the form 2m,;+1 with m, an integer. 
We have 


> m(m,+1) _ 1 (‘< 


( Qm,+1)?-3) = - (8n+3-—3) =n. 


] 1 
i=1 2 8 \i= 8 


Chapter V 


Integral Quadratic Forms with Discriminant + | 


§1. Preliminaries 


1.1. Definitions 


Let n be an integer 20. We are interested in the following category S,: 

An object E of S, is a free abelian group of rank n (i.e. isomorphic to Z") 
together with a symmetric bilinear form E x E — Z, denoted by (x, y) x.y, 
such that: 

(i) The homomorphism of E into Hom(E, Z) defined by the form x.y is an 
isomorphism. 

One sees easily that this condition is equivalent to the following (cf. BOURBAKI, 
Alg., chap. IX, §2, prop. 3): 

(il) Jf (e,) is a base of E, and if a;; = e.e;, the determinant of the matrix 
A = (a;;) is equal to +1. 

The notion of isomorpnism of two objects E, E’ € S, is defined in an ob- 
vious way. One then writes E ~ E’. It is also convenient to introduce 
S=vUS,,n0 =0,1,... 

If Ee S,, the function x» x.x makes E a quadratic module over Z (cf. 
chap. IV, def. 1, n° 1.1). If (e;) is a basis of E and if x = Xx,e;, the quadratic 
form f/(x) = x.x is given by the formula 


f(x) = Di jXiX; with a;; = €;.e; 
i,J 


= ¥ ajjx7+ 2D a; ;x;x;. 
i i<j 
The coefficients of the non-diagonal terms are thus even. The discriminant 
of f (i.e. det(a;;)) is equal to + 1. Changing the basis (e;) means replacing the 
matrix A = (a;;) by ‘BAB with B « GL(n, Z). From the point of view of the 
form f, this means changing the variables (x;) by the linear substitution 
with matrix B; the form so obtained is called equivalent to the form f. (Ob- 
serve that this is an equivalence over the ring Z of integers; it is finer than the 
equivalence over Q studied in the preceding chapter.) 


1.2. Operations on S 


Let E, E’ eS. We denote by E@ E’ the direct sum of E and of E' 
together with the bilinear form which is the direct sum of those on E and E’; 
by definition (cf. BoURBAKI, Alg., chap. IX, § 1, n° 3): 

(x+x').(yty’) = xytx'y’  ifx, ye Eandx',y’eE’. 
48 
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From the point of view of “quadratic forms” this operation corresponds to 
that of orthogonal direct sum denoted by @ in chap. IV. 

One can also define tensor products E@® E' and exterior powers A"E 
(BouRBAKI, /oc cit., n° 9); we will not need them. 


1.3. Invariants 


1.3.1. If Ee S,, the integer n is called the rank of E and is denoted by r(E). 


1.3.2. Let Ee Sand let V = E @ R the R-vector space obtained by extending 
the scalars from Z to R. The quadratic form of V has a well defined signature 
(r, s) (chap. IV, n° 2.4). The integer 


7(E) =r-s 
is called the index of E. One has 
—r(E) S$ (E)S r(E) and -(E) = 7(E)(mod 2). 


Recall that E is called definite if 7(£) = +r(E), ie. if x.x has constant 
sign; otherwise E is called indefinite. 


1.3.3. The discriminant of E with respect to a basis (e;) does not depend on 
the choice of this basis. Indeed, changing the basis (e;) multiplies the dis- 
criminant by det(X'X) = det(X)* where X is an invertible matrix over Z: 
the determinant of X is equal to +1, and its square is equal to 1. 

The discriminant of E is denoted by d(E); one has d(E) = +1. If 
V = E @ Riis of Signature (r, 5), the sign of d(E£) is (—1)°; since d(E) = +1, 
we get the formula: 

d(E) —_ (— 1B EZ 

1.3.4. Let Ee S. We say that E ts even (or of type Il) if the quadratic form 
associated with E takes only even values; if A is the matrix defined by a 


basis of E, this amounts to saying that all the diagonal terms of A are even. 
If E is not even, we say that E is odd (or of type 1). 


1.3.5. Let Ee S and let E = E/2E be the reduction of E modulo 2. It is 
a vector space of dimension r(E) over the field F, = Z/2Z. By passage to 
quotient, the form x.y defines on E a form x.y which is symmetric and of 
discriminant +1 = 1. The associated quadratic form x.x is additive: 


(X+H).(F+H) = KK+PptAWyp = KL+P/ 
Thus it is an element of the dual of E. But the bilinear form x.y is non- 


degenerate; it defines an isomorphism of E on its dual. From this we see that 
there exists a canonical element ii ¢ E such that 


“ux = X¥.x forall te E. 
Lifting u to E, we obtain an element ue E, unique modulo 2£, such that 


u.xx =x.x(mod2)  forallxeE. 
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Consider the integer u.u. If we replace u by u+2x, u.u is replaced by 
(u+2x).(u+2x) = uut+4(u.xt+x.x) = u.u(mod 8). 


The image of u.u in Z/8Z is thus an invariant of E; we denote it by o(£). 
If E is of type IH, the form X.x is zero (in other words, x.y is alternating) 
and we can take u = 0, hence o(£) = 0. 


1.3.6. Let p be a prime number, and let V, = E@ Q, be the Q,-vector 
space obtained from E by extending the scalars from Z to Q,. The invariant 
e(V,) = +1 of V, defined in chap. IV, n° 2.1 is a fortiori an invariant of E; 
we denote it by «,(£). One can prove that: 


e(E) = lifp #2 
e,(E) = (—1)/, where j = 1(d(E)4+7r(E)—o(E)—1). 


This is seen by splitting E @ Z, in an orthogonal direct sum of Z,-modules 
of rank 1 (resp. rank 1 or 2) if p + 2 (resp. p = 2). Since we do not use 
these formulas, we leave the details of the verification to the reader (see also 
J. CASSELS, Comm. Math. Helv., 37, 1962, pp. 61-64). 


1.3.7. Let E,, E,¢S and E = E, @ Ej. In order that E is of type II it is 
necessary and sufficient that E, and E, are of type II. One has: 


r(E) = r(E\)+r(E2), (EL) = r(E,)+7(E2) 
o(E) = of E,)+0(E2), ME) = d(E,).d(E>). 


1.4. Examples 


1.4.1. We denote by /, (resp. /_) the Z-module Z with the bilinear form 
xy (resp. —xy); the corresponding quadratic form is +x? (resp. — x2). 

If s and t are two positive integers, we denote by s/, @ t/_ the direct sum 
of s copies of 7, and ¢t copies of J_; the corresponding quadratic form is 


s 


é 
x x? — x y;. The invariants of this module are: 
j= 


i=] 
r=S+t,7 =s-—t,d =(-—l)',o = s—t(mod 8). 
Aside the trivial case (s, t) = (0, 0), the module s/, @ t/_ is of type /. 


1.4.2. We denote by U the element of S, defined by the matrix (; 0) The 


associated quadratic form is 2x,x,; U is of type II. One has: 
r(U) = 2,7(U) = 0, dU) = —1,0(U) = 0 


1.4.3. Let k be a positive integer, let n = 4k, and let V be the vector 
space Q” with the standard bilinear form 2x,;y; corresponding to the unit 
matrix. Let Ey = Z” be the subgroup of V formed from the points with 
integer coordinates, endowed with the bilinear form induced from that of V; 
Ey is an element of S, isomorphic to nJ,. Let E, be the submodule of E, 
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formed of elements x such that x.x = 0 (mod 2), Le. 2x; = 0 (mod 2). 
One has (E,:£,) = 2. Let E be the submodule of V generated by £, and 
by e = (4,..., 4). One has 2e € E, (since n = 0 (mod 4)) and e ¢ £,, hence 
(E:E,) = 2. For an element x = (x,) of V to belong to E, it is necessary and 
sufficient that one has 


2x,;¢Z, x;-x,;¢Z, } x, ¢22Z. 
iat 


Then we have x.e = 42x; « Z; since e.e = k, this shows that the form x.y 
takes integral values on E. Moreover, the fact that E, has the same index in E, 
and in E shows that the discriminant of E is equal to that of Eo, that 1s to say 
+1. The quadratic module E is thus an element of S, = S4,; we denote it 
by T',. When k is even (i.e. when n = 0 (mod 8)) e.e = k is even and this 
implies that x.x is even for all x ¢ E; T, is thus of type 11 when n = 0 (mod 8). 
One has 


rT gn) = 8m, (Van) = 8m, o(1" gn) = 0, A(T’ gn) = I. 
The case of I, is particularly interesting. There are 240 elements"? x eT’, 


such that x.x = 2; if (e,) denotes the canonical base of Q®, these are the 
vectors 


8 8 
+ete,(@iA#Ak) and 4) ee, ¢,= +1,[]¢ = 1. 
i=1 i=1 
[The mutual scalar products of these vectors are integers; they form what is 
called in Lie group theory a “‘root system of type E,’’ see, BOURBAKI, Gr. et 


Alg. de Lie, chap. VI, §4, n° 10.] 
One can take as a basis of I’, the elements 


4(e, tes) —4(e, +... +0€7),e;+e,, and e;—e,-,(2 $i 7). 


The corresponding matrix 1s 


2 OO -!1 0 0 O0 O 90 
0 2 0 -1 0 0 oO QO 
—1 0 2 -!1 0 0 O 9 
0 -1 -! 2 -1 0 oO 90 
0 0 O -!1 2 —!1 0 90 
0 0 0 90 -!1 2 —-1 0 
“0 0 0 0 O -I 2 —-l 
“0 0 0 0 0 90 -1 2 


For m 2 2 the vectors x ce I’,,, such that x.x = 2 are simply the vectors 
+e,te,(i # k); note that they do not generate I',,,, contrary to what 
happens for m = 1. In particular, [', @ Il, is not isomorphic to I, ¢. 


|) More generally we will show in chap. VII, n° 6.5, that, if N is an integer 21, the 
number of x ¢ I's such that x.x = 2N is equal to 240 times the sum of the cubes of divisors 
of N. 
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1.5. The group K(S) 


Let E, E’ « S. We say that E and £’ are stably isomorphic if there exists 
FeSsuch that E@ F ~ E’ @ F; this is an equivalence relation. We denote 
by K,(S) the quotient of S by this relation and if E « S, we denote by (E) 
the class of E in K,(S). The operation @ defines by passage to quotient a 
composition law + on K,(S); this law is commutative, associative, and 
has for neutral element the class 0 of the module 0 « S. One has 


(E ® E’) = (E)+(E’). 


Moreover, if x, y, z¢K,(S) are such that x+z = y+z, one has x = y; 
the proof is immediate. This allows us to define the group K(S) from the 
semi-group K,(S) (exactly as one defines Z from the set Z, of positive 
integers): By definition, an element of K(S) 1s a pair (x, y) with x, ye K,(S), 
two pairs (x, y). (x’, y’) being identified if and only if x+y’ = y+.x’. The 
composition law of K(S) is defined by 
(x, V+, vy’) = (+2, pty’). 
It makes K(S) into a commutative group with neutral element (0, 0). We 
identify K,(S) with a subset of K(S) by the map x +> (x, 0). Every element of 
K(S) is a difference of two elements of K,(S), thus can be written in the 
form (E)—(F) with E, F = S. One has 
(E)—(F) = (E')—-(F’) in K(S) 

if and only if there exists Ge S such that E@ F’@G~ ELE’ O@FOG, 
i.e. if and only if E @® F’ and E’ @ F are stably isomorphic. 

Universal property of K(S).—Let A be a commutative group and let 
f: SA be a function such that f(£) = /(E,)+/(£,) if Ex~ E, ® E,; 
we then say that f is additive. If x = (E)—(F) is an element of K(S) we put 

K(X) = f(E) AUP); 

this does not depend on the chosen decomposition of x. It is obvious that 
the function f: K(S)— A thus defined is a homomorphism. Conversely, 
every homomorphism /: K(S)— A gives, by composition with S — K(S), 
an additive function on S. We express this “universal” property of K(S) 
by saying that K(S) is the Grothendieck group of S relative to the operation @. 

In particular, the invariants r, 7, d, o of n° 1.3 define homomorphisms 

r: K(S)>Z, 1: K(S)>Z, d: K(S)—> {+41}, o: K(S) > Z/8Z. 


We have again + = r mod 2 and d = (—1)°"/?. 


§2. Statement of results 
2.1. Determination of the group K(S) 


Theorem 1.—The group K(S) is a free abelian group with basis (1) and (1_). 
(The proof will be given in n° 3.4.) 
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In other terms, all f ¢ K(S) can be written uniquely as f = s.(/,)+¢.(/_) 
with s, t¢ Z. One has r(f) = s+t, 7(f) = s—t, which shows that s and f 
are determined by r and 7. From this follows: 


Corollary 1.—The pair (r, +r) defines an isomorphism of K(S) onto the 
subgroup of Zx Z formed of elements (a, b) such that a = b (mod 2). 
Hence: 


Corollary 2.—For two elements E and E' of S to be stably isomorphic 
it is necessary and sufficient that they have same rank and same index. 


{Note that this does not imply E =~ E’ For example U = (; 0} defines 


in K(S) the same element as (5 4) =], ®/_, but U and /, @/._ are 


of different types. ]} 


Theorem 2.—One has o(E) = r(E) (mod 8) for every Ee S. 
Indeed + reduced mod 8, and o, are homomorphisms of K(S) in Z/8Z 
which are equal on the generators /, and /_ of K(S); hence they coincide. 


Corollary 1.—/f E is of type Il, one has 7(E) = 0 (mod 8). 
Indeed o(E) = 0. 
(Note that this implies that r(E) = 0 (mod 2) and d(£) = (—-1)"®/?.) 


Corollary 2.—-/f E is definite and of type II, one has r(E) = 0 (mod 8). 
Indeed we have 7(E) = +r(E£) if E is definite. 


Remarks—1) Conversely, we saw in n° 1.4 that for all m divisible by 8, 
there exists E ¢ S, which is positive definite and of type IT. 

2) The congruence o(£) = 7(E) (mod 8) can also be deduced from the 
product formula Ile(E£) = 1 (see chap. IV, n° 3.1) combined with the 
values of «,( £) given (without proof) in n° 1.3.6. 


2.2. Structure theorems (indefinite case) 


Let Ee S. One says that E represents zero if there exists x e E, x + 0, 
such that x.x = 0. This is equivalent to saying that the corresponding 
quadratic form Q(x) represents 0 over Q in the sense of chap. IV, n° 1.6; 
indeed, one passes from a rational zero to an integral zero by a homothety. 


Theorem 3.—/f Ee¢S is indefinite, E represents zero. 
(The proof will be given in n° 3.1.) 


Theorem 4.—/f E ¢ S is indefinite and of type I, E is isomorphic to sl, ® tl_ 
where s and t are integers 21. 
[The corresponding quadratic form is thus equivalent over Z to the 


form Bade — b> Oi 2] 
(The proof will be given in n° 3.3.) 
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Corollary.—Let E and E' be two elements of S with the same rank and 
index. Then either E@I, ~ E'’@l, on E@I_~E'@O@L.. 

This is clear if E = 0. Otherwise, one of the two modules E @ /, or 
E @ I_ is indefinite. Suppose that the first is. Since E and E’ have the same 
signature, E’ @ /, is also indefinite. By applying theorem 4, we see that 
E@I/, and E’ @ /, areisomorphictos/, @ t/_ and s‘I, @ t’/_ respectively. 
Since E and E’ have the same signature, we have s = s’ and ¢t = ft’, hence the 
result. 


Theorem 5.—/f ES is indefinite of type ll, and if 7(E) 2 0, then E is 
isomorphic to pU ® ql, where p and q are positive integers. 

[When 7(£) < 0, one has a corresponding result obtained by applying 
the theorem to the module-E deduced from E by changing the sign of the 
quadratic form.] 

(The proof will be given in n° 3.5.) 

Note that g = ¢r(£) and p = $(r(E)—7(E£)). This shows that E£ is 
determined up to isomorphism by its rank and its tndex. Since the same is 
true for type I (cf. theorem 4), we get: 


Theorem 6.—/f E, E’ ¢ S are indefinite, and have same rank, index, and 
type, they are isomorphic. 


2.3. The definite case 


One does not have a structure theorem, but only a finiteness theorem: 
for all integers n, S, contains but a finite number of positive definite classes. 
This follows, for instance, from the “reduction theory” of quadratic forms. 
The explicit determination of these classes has been made only for small 
values of n (forn S 16, see M. KNESER, Archiv der Math., 8, 1957, pp. 241-250). 
One can get this from the Minkowski-Siegel formula (Kneser uses a different 
method). I will just state this formula (I restrict myself, for the sake of 
simplicity, to type II[—there are analogous results for type I): 

Let n = 8k be an integer divisible by 8. Let C, denote the set of iso- 
morphism classes of elements E ¢ S, which are positive definite of type II. 
If E € C,, let Gp be the group of automorphisms of E; it is a finite group since 
it is a discrete subgroup of the orthogonal group, which is compact; let g; 
be the order of Gy. Put: 

M, = y; 1/2. 
EeC,, 
This is the ‘‘mass” of C,, in Eisenstein’s sense, i.e. the number of elements 
E of C,, counted each with the multiplicity 1/g-. The Minkowski-Siegel 
formula‘) gives the value of M,: 


“) For a proof of this formula, cf. C. L. SIEGEL, Gesamm. Abh., 1, n° 20 and III, n° 79, 
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where n = 8k, and the B, are the Bernoulli numbers (8, = 


cf. chap. VII, n° 4.1). 


(Here are some approximate values of the M,: 
M,=107? x 1.4352. ..; Myg=107 '® x 2.4885... ; My,4=107 '* x 7.9367. .., 
M3, = 107 x 4.0309... ; My = 10°' x 4.3930...) 

This formula gives a method to prove that a subset C’ of C, is equal 
to C,; it suffices to check that the sum of the I/g,, for E¢C’, is equal to 
M, (if C’ + C,, this sum is < M,). 


Examples 

i) n = 8, i.e. k = 1. One can show (see for instance BOURBAKI, Gr. et Alzg. 
de Lie, chap. VI, §4, n° 10) that the order of the group of automorphisms of 
I’, is 2'*35527, Moreover, formula (*) gives M, = 27~'* 37° 57? 771. By 
comparing, we see that C, is reduced to the single element I',, a result due to 
Mordell. 


ii) n = 16. We know two elements of C,,: [',, and [, @ I's. One can 
prove that the corresponding orders gz are respectively 2'°(16!) and 
2793105472, Moreover M,, = 691.27 3°3~ '195~*7- 71171137! and it is easy 
to check that 

691 /239319547211.13 = 1/2!5(16!) + 1/2293'95*7?, 
We thus have C,, = {I'y., ls @ I's}, a result due to Witt. 
ii) m= 24. The determination of C,, has been made in 1968 by H. 
Niemeier‘; this set has 24 elements. One of them (discovered by Leech a 
propos of the sphere-packing problem in R™) is particularly remarkable; it 


is the only element of C,, which contains no vector x with x.x = 2. Its group 
of automorphisms G has order: 


2773°547711.13.23 = 8 315 553 613 086 720 000. 


The quotient G/{ + 1) is the new simple group discovered by Conway”. 


iv) n = 32. Since M3, > 4.10’ and g, = 2 for all E, we see that C;, has 
more than 80 million elements; they have not been classified yet. 


§3. Proofs 
3.1. Proof of theorem 3 


Let EeS, and let V= E@Q the corresponding Q-vector space. 
Suppose E is indefinite. We must show that E (or V) represents zero. We 
consider several cases: 


|) See H. NIEMEIER, J. Number Theory, 5, 1973, pp. 142-178. 


(2) See J. H. Conway, Proc. Nat. Acad. Sci. USA, 61, 1968, pp. 398-400, and Invent. 
Math., 7, 1969, pp. 137-142. 
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i) n = 2. The signature of V is then (1, 1), hence dE) = —1. Since —d(£) 
is a square in Q, it is clear that V represents 0. 


ui) n= 3. Let f(X,, X2, X3) = 2a,;X;X; be the corresponding quadratic 
form with respect to a basis of E; one has a,,;¢«Z and det(a;,;) = +1. 
If pis a prime number #2, the form deduced from f by reduction modulo p 
has a nontrivial zero (chap. I, n° 2.2), and this zero can be lifted to a p-adic 
zero (chap. II, n° 2.2), and cor. 2 to th. 1). Hence f represents 0 in all the 
Q,(p + 2), and also in R; by cor. 3 of th. 8 of chap IV, n° 3.2, this implies 
that f represents 0 in Q. 


iii) n = 4. The same argument as above shows that the quadratic form / 
represents 0 in all the Q,, p + 2, and in R. If the discriminant d(E) of f is 
equal to I, this suffices to imply that f represents 0 in Q (cor. 3 to th. 8 of 
chap. IV, n° 3.2). Otherwise one has d(£) = —1 and d(E) is not a square 
in Q,; by th. 6 of n° 2.2. of chap. IV, this implies that f represents 0 in Q,; 
the Hasse-Minkowski’s theorem (chap. IV, n° 3.2, th. 8) then shows that / 
represents 0 in Q. 


iv) n = 5. One applies Meyer’s theorem (chap. IV, n° 3.2, cor. 2 to th. 8). 


3.2. Lemmas 


Let E « S and let F be a submodule of E; let F’ be the set of elements of 
E orthogonal to all elements of F. 


Lemma 1.—Jn order that F, with the form x.y induced from that of E, 
be in S it is necessary and sufficient that E be the direct sum of F and F’. 

If E = F @ F’, then one has d(£) = d(F).d(F’) from which d(F’) = +1. 
Conversely if d(F) = +1, one has clearly FO F’ = 0; moreover, if x ¢ E, 
the linear form y+ x.y(y € F) is defined by an element x, ¢ F. We then 
have x = X9+x,, with x» ¢ F and x, ¢ F’, hence E = F@ F’. 


Lemma 2.—Let x « E be such that x.x = +1 and let X be the orthogonal 
complement of x in E. If D = Zx, one has E= D® X. 

One applies lemma | to F = D. (If, for instance, x.x = 1, one has D ~ /, 
hence E~ 1, @ X.) 

An element xe E is called indivisible if it is not contained in any sub- 
group nE (n 2 2), 1.e., if one cannot divide it by any integer 22. Every non- 
zero element of E can be written in a unique way in the form mx with m 2 1 
and x indivisible. 


Lemma 3.—/f x is an indivisible element of E there exists y « E such that 
xy = 1. 

Let f, be the linear form yr> x.y defined by x. It is a homomorphism 
E - Z. Moreover, f, is indivisible since x is and since x.y defines an iso- 
morphism of £ onto its dual Hom (E, Z). From this follows that /, is surjec- 
tive (otherwise, one could divide it by an integer 22) and there exists thus 
y €E such that x.y = 1. 
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3.3. Structure theorem (odd indefinite case”) 


Lemma 4.—Let E « S,. Suppose E is indefinite and of type |. There exists 
FeS,_, such that E~1, @1_ @ F. 

By theorem 3, there exists x ¢ FE, x # 0 such that x.x = 0. Being free to 
divide x by an integer, we can suppose x is indivisible; by lemma 3 above, 
there exists thus y e E such that x.y = 1. One can choose y of such that y.y 
is odd. Indeed, suppose that y.y is even; since E is of type I, there exists te E 
such that t.t is odd. Put y’ = t+ky and choose k such that x.y’= 1, Le. 
k = 1—x.t; one has y’.y’ = t.t (mod 2) and y’.y’ is odd. We can thus suppose 
that y.py = 2m4+1. Put then e, = y—mx, e, = y—(m+1)x. One checks 
immediately that e,.e, = 1, e,.e. = 0, e,.e, = —1. The submodule G of E 
generated by (e,, e,) is isomorphic to J, © J_; by lemma 1, we have thus 
E~l, @/_ @F, with Fe S,_,. 


Proof of theorem 4.--We use induction on n. Let E € S, with E indefinite 
and of type I. By lemma 4, Ex~ /, @/. @F. If n = 2, we have F = 0 
and the theorem is proved. If 2 > 2, we have F 4 0 and one of the modules 
I, © F,I_ @ F, is indefinite, for instance the first one. Since /, 1s of type I, 
the same is true for /, © F and the inductive hypothesis shows that J, © F 
is of the form al, @® bI_; this shows that E ~ al, ® (O+1)/_. 


3.4. Determination of the group K(S) 


Let Ee S, E40. Then E@/, or E@/_ is indefinite and of type I. 
Applying theorem 4, we see that the image of £ in K(S) is a linear combina- 
tion of (/,) and of (/_). This implies that (/,) and (/_) generate K(S). Since 
their images by the homomorphism 


(r, 7): K(S) ~ ZxZ 
are linearly independent, (/,) and (J_) form a basis of K(S). 


3.5. Structure theorem (even indefinite case) 


Lemma 5.—Let E ¢ S. Suppose E is indefinite and of type \l. There exists 
FeS such that E~ U®F. 

We proceed as in the proof of lemma 4. Choose first xe EF, x + 0, x 
indivisible such that x.x = 0; choose next ye EF such that xy = 1. If 
y.y = 2m, we replace y by y—mx and obtain a new y such that y.y = 0. 
The submodule G of E generated by (x, y) is then isomorphic to U; by lemma 
l one has E~ U@ F with FeS. 


Lemma 6.—Let F,, F, ¢ S. Suppose that F, and F, are of type Il and that 
I, @I_@F, ~/l. OI. © F,. Then U@F, =~ USF). 


“) The method followed in this section has been shown to me by Milnor, together 
with the idea of introducing the group K(S). 
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To simplify the notations, we put W=/, @/_, E; = W@F,, and 
V, = E;®Q. In E,, let E° be the subgroup of elements x such that x.x = 0 
(mod 2); it is of index 2 in E;. One sees immediately that E2 = W° @ F, where 
W° is the set of elements x = (x,, x,) of W such that x, = x, (mod 2). 
Let Et be the ‘‘dual” of E? in V,, i.e. the set of y ¢ V; such that x.y « Z for 
all x ¢ E?. It is clear that E> = W* © F; where W* is the set of (x,, x2) 
such that 2x, « Z, 2x, «Z, x,;—x,¢Z. One has E° Cc E, < E* and the 
quotient E,*/E? is isomorphic to W*/W,; it is a group of type (2, 2). There 
exist thus three subgroups strictly between E{ and £;*; they correspond to 
the three subgroups of order 2 in a group of type (2, 2). One of them is £; 
itself; the two others will be denoted by E; and £,’. Here again we have: 


Ej =W'@F, and E’=W’' OF, 


where W’ and W” are defined in an obvious way. One checks that W’ and 
W" are isomorphic to U (for instance, take for basis of W’ the vectors 
a = (4 4), 6=(I, —1); one has aa = b.b = 0, a.b = 1; for W’, take 
(4, —4) and (/, 1)). Let then f: W @ F, ~ W@ F, be an isomorphism. It 
extends to an isomorphism of V, onto V,, which carries E, onto E,, thus 
also E? onto E? and E,* onto E,*. Thus it carries also (Ej, Ej) onto either 
(E;, £3) or (£3, E;). Since Ej and E; are isomorphic to U @ F,, one sees 
that U@F, ~ U@ Fy. 


Proof of theorem 5.—We first prove that if E,, E, €« S are indefinite of 
type II and have the same rank and same index, they are isomorphic. By lemma 
5, one has FE; = U@F,, F, = U@® F;; it is clear that F, and F, are of 
type II and same rank and same index. The modules /, @ /_ @ F, and 
I, ®/. ® F, are indefinite, of type I, of same rank and index. By theorem 
4, they are isomorphic. Applying lemma 6, we see then that £, and E, are 
isomorphic, which proves our assertion. 

Theorem 5 is now clear: if E is indefinite, of type I], and if 7(£) 2 0 we 
determine integers p and g by the formulas 


q=37(E), p= HE)—7(E)). 


By applying the above result to the modules E and pU @ qI', one sees that 
these modules are isomorphic. 


Part Il 


Analytic Methods 


Chapter VI 


The Theorem on Arithmetic Progressions 


The aim of this chapter is to prove the following theorem, conjectured 
(and used) by Legendre and proved by Dirichlet. 


Theorem.—Let a and m be relatively prime integers 21. There exist 
infinitely many prime numbers p such that p = a (mod m). 

The method we follow (which is that of Dirichlet himself) uses the 
properties of the L-functions. 


§1. Characters of finite abelian groups 


l.l. Duality 


Let G be a finite abelian group written multiplicatively. 


Definition 1.—A character of G is a homomorphism of G into the multi- 
plicative group C* of complex numbers. 

The characters of G form a group Hom (G, C*) which we denote by G; 
it 1s called the dual of G. 


Example. Suppose that G is cyclic of order n with generator s. If 
x:G —C* is a character of G, the element w = ,(s) satisfies the relation 
w" = 1, Le. is a mth root of unity. Conversely every nth root of unity w 
defines a character of G by means of s*++ w*. Thus we see that the map 
x ++ (s) is an isomorphism of G on the group p, of nth roots of unity. In 
particular, G is cyclic of order n. 


Proposition 1.—Let H be a subgroup of G. Every character of H extends 
to a character of G. 

We use induction on the index (G:H) of H in G. If (G:H) = 1, then 
H = G and there is nothing to prove. Otherwise let x be an element of G 
not contained in H, and let n be the smallest integer >1 such that x" ¢ H. 
Let x be a character of H, and let ¢ = x(x"). Since C* is a divisible group, 
one can choose an element w ¢ C* such that w” = ¢. Let H’ be the subgroup 
of G generated by H and x; every element A’ of H’ can be written h’ = hx* 
with ae Z and he H. Put 


x'(h’) = x(A)w’. 
One checks that this number does not depend on the decomposition Ax* of 
h’ and that y’: H’ —+(C* is a character of H’ extending y. Since we have 


(G: H’) <(G:H) the inductive hypothesis allows us to extend x’ to a 
character of G. 
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Remark. The operation of restriction defines 2 homomorphism 
p:G—>A 
and prop. 1 says that p is surjective. Moreover, the kernel of p is the set of 
characters of G which are trivial on H; it is thus isomorphic to the group 
(G/H) dual to G/H. Hence the exact sequence: 


{1} > (G/B) > 6 +A > {1}. 


Proposition 2.—The group G is a finite abelian group of the same order as G. 

One uses induction on the order n of G, the case n = | being trivial. 
If n 2 2, choose a nontrivial cyclic subgroup H of G. By the remark above, 
the order of G is the product of the orders of A and of (G/B). But the 
order of H (resp. of G/H) is equal to that of its dual, because H is cyclic 
(resp. because G/H is of order strictly smaller than 1). We conclude from 
this that the order of G is the product of the orders of H and of G/H, hence 
is equal to the order of G. 


Remark. One can prove a more precise result: G is isomorphic (non- 
canonically in general) to G. This is shown by decomposing G into a product 
of cyclic groups. 

If x eG the function y+ (x) is a character of G. We obtain thus a 


homomorphism ¢«: G > G. 


Proposition 3.—The homomorphism « is an isomorphism of G onto its 


bidual G. a 

Since G and G have the same order, it suffices to prove that « is injective, 
i.e. that, if xeG is #1, there exists a character y of G such that x(x) + I. 
Now, let H be the cyclic subgroup of G generated by x. It is clear (see the 
above example) that there exists a character x of H such that x(x) + | and 
prop. | shows that y extends to a character of G; hence the desired result. 


1.2. Orthogonality relations 
Proposition 4.— Let n = Card(G) and let x € G. Then 


_ {a ify =1 
Ze) =| ifx +1. 


The first formula is obvious. To prove the second, choose y eG such 
that x(y) + 1. One has: 


xy) Y x) = » xy) = & x0), 
xeG xeG xéG 


hence: 


(XxQ)— 1) X x(x) = 0. 


Since x(y) + 1, this implies 2 x(x) = 0. 
xéG 
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Corollary.—Let x ¢ G. Then 


_ {nf x=l 
Z x0) = {5 if xt. 


This follows from prop. 4 applied to the dual group G. 


Remark.—The above results are special cases of the “orthogonality 
relations” of the character theory of finite groups (not necessarily abelian). 


1.3. Modular characters 


Let m be an integer 21. We denote by G(m) the multiplicative group 
(Z/mZ)* of invertible elements of the ring Z/mZ. It is an abelian group of 
order ¢(m), where ¢(m) is the Euler ¢-function of m, cf. chap. I, n° 1.2. An 
element y of the dual of G(m) is called a character modulo m; it can be 
viewed as a function, defined on the set of integers prime to m, with values 
in C*, and such that y(ab) = y(a)x(b); it is convenient to extend such a 
function to all of Z by putting x(a) = 0 if a is not prime to m. 


Examples 
1) m = 4; the group G(4) has two elements, hence has a unique nontrivial 
character, which is x ++ (—1)*, cf. chap. I, n° 3.2. 
2) m= 8; the group G(8) has four elements. It has three nontrivial 
characters which are 

XK (— 1), (— 1°), (— LP Fe@) 


cf. chap. I, n° 3.2. 
3) m = p with p prime +2. The group G(p) is cyclic of order p—1, hence 


has a unique character of order 2, the Legendre character x +> (¢ 
P 
4) m = 7. The group G(7) is cyclic of order 6, thus has two characters of 


order 3 which are complex conjugates. One of them is given by 
x(x) = 1 ifx = +1 (mod 7) 
x(x) = e793 if x = +2 (mod 7) 
x(x) = e4**"/? if x = +3 (mod 7). 


The characters of order 2 are closely related to the Legendre characters. 
More precisely: 


Proposition 5.—Let a be a non-zero square-free integer (cf. chap. IV, n° 3.2) 
and let m = 4ja|. Then there exists a unique character x, modulo m such that 


XP) = (*) for all prime numbers p not dividing m. One has x2 = 1 and 
P 
x. £ lifa +1. 


The uniqueness of x, is clear because all integers prime to m are products 
of prime numbers not dividing m; the same argument shows that x? = 1. 
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To prove the existence of y,, assume first that a = /,...4, where the /; 
are distinct prime numbers, different from 2. Then we take for y, the character 


— (—y)\elxe(a) [ * x 
Xx) = (—1) ()--(): 


If p is a prime number distinct from 2 and the /;, the quadratic reciprocity 


law shows that / / 2 
wv-)--G) 
p Pp P 


and y, has the required property. We have y, + | if a@ + 1; indeed, if we 
choose x such that 


(7)= —| and x=1 (mod 4/,... /,), 


we have x,(x) = —I. 

When a is of the form —6 (or 2b or —2b) with b = /,.../, as above, 
we take for x, the product of y, with the character (—1)* (or (—1)?™ 
or (—1)**)+°)), A similar argument shows that yx, + 1. 

Remark.—One can prove that, if x is an integer >0 prime to m, then 

x( =[] ax = [] @~, 
i|m (lym) =1 
where (a, x), denotes the Hilbert symbol of a and x in the field Q,. This 
formula could have been used to define x,. 


§2. Dirichlet series 


2.1. Lemmas 


Lemma 1.—Let U be an open subset of C and let f, be a sequence of 
holomorphic functions on U which converges uniformly on every compact set 
to a function f. Then f is holomorphic in U and the derivatives f,' of the f, 
converge uniformly on all compact subsets to the derivative f' of f. 

Let us recall briefly the proof: 

Let D be a closed disc contained in U and let C be its boundary oriented 
in the usual manner. By Cauchy formula, one has 


feo) = | LO. ae 


Zin | Z—Zg 
Cc 


for all z,) interior to D. Passing to the limit, one gets: 


feo) = 50 | FO ae 


2in Z—Zo 


* 


which shows that fis holomorphic in the interior of D, and the first part of 
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the lemma follows. The second part is proved in the same way, using the 


formula: 
f'(@) = =— | IO) a 


2in | (z—2Z0)* 
¢ 


Lemma 2 (Abel’s lemma).—Ler (a,) and (b,) be two sequences. Put: 


n=p a=im’ 
Amp = ¥, 4 and Sym = Y aqby. 
=m n=m 
Then one has: 
n=m'— 1 
Sint, m’ = y Am, nOn— Ones) + Am, Om 
n=m 


One replaces a, by Am.n—Am.,-1 and regroups the terms. 


Lemma 3.—Let a, B be two real numbers with 0 < « < B. Let z = x+iy 
with x, ye R and x > 0. Then: 


— 


le? ef < Zz (e~ ** —e *), 


One writes 


hence by taking absolute values: 


B 
le~*7 -e ¥7| < |2| fevva = | (e~** —e7 **), 
x 
a 


2.2. Dirichlet series 


Let (A,) be an increasing sequence of real numbers tending to +00. For 
the sake of simplicity, we suppose that the A, are >0 (this is not essential, 
for we can always reduce ourselves to this case by suppressing a finite number 
of terms of the series under consideration). 

A Dirichlet series with exponents (A,) is a series of the form 


ae“ (a,¢C,zeC). 


Examples 

(a) 4, = log n (ordinary Dirichlet series); such a series is written 2 a,/n’, 
cf. n° 2.4. 

(b) A, = n. By setting t = e~7, the series becomes a power series in . 


Remark.—The notion of Dirichlet series is a special case of that of the 
Laplace transform of a measure yp. This is the function 


few u(t). 
Q 
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The case considered here is that where uz is a discrete measure. (For more 
details, see for instance D. WippeR, The Laplace Transform, Princeton Univ. 
Press, 1946.) 


Proposition 6.—Jf the series f(z) = Xa,e~*"? converges for z = Zo, it 
converges uniformly in every domain of the form R(z—Z,.) 2 0, Arg(z—Zp) S « 
with a < m/2. | 

(Here, and in all that follows, R(z) denotes the real part of the complex 
number z.) 

After making a translation on z, we can suppose that z) = 0. The 
hypothesis then means that the series Za, is convergent. We must prove that 
there is uniform convergence in every domain of the form R(z) 2 0, |z|/R(z) 
< k. Let e > 0. Since the series 2a, converges, there is an N such that if 
m,m' 2 N, we have |A,, _’| Se (notations being those of lemma 2). Applying 
this lemma with b, = e~*"*, we obtain 


m’= | 


Smom = > Am ler ett FAR en, 
m 


By putting z = x+iy and applying lemma 3, we find: 


m’—1 
LSinm’ | = (1 + 2 > (erie etn), 
x 


that is to say: 
[Sinme| Se (L+k(e7 4m — e4m'*)) 
hence: 
|Snm'| & e(1 +k), 


and the uniform convergence Is clear. 


Corollary 1.—/f f converges for z = Zo, it converges for R(z) > R(zo) and 
the function thus defined is holomorphic. 
This follows from prop. 6 and lemma 1. 


Corollary 2.—The set of convergence of the series f contains a maximal 
open half plane (called the half plane of convergence). 

(By abuse of language we consider 2 and C as open half planes.) 

If the half plane of convergence is given by R(z) > p, we say that p is 
the abscissa of convergence of the series considered. 

(The cases 2 and C correspond respectively to p = +0 and p = — oo). 

The half plane of convergence of the series Lla,|e~4"? is called (for 
obvious reasons) the half plane of absolute convergence of f; its abscissa of 
convergence is denoted by p*. When A, = n (power series), it is well known 
that p = p". This is not true in general. For example the simplest Z series: 


L(z) = 1-1/3? +1/5’-1/77 +... 


corresponds to p = 0 and p* = I, as we will see later. 
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Corollary 3.—/(z) converges to f(Z,) when Z —> Zo in the domain 
R(z—2)) 2 0, |Arg (z—Zo)| S & with « <7n/2. 


This follows from the uniform convergence and the fact that e~ +"? tends 
to e7 470, 


Corollary 4.—The function f(z) can be identically zero only if all its 
coefficients a, are zero. 

Let us show a, is zero. We multiply f by eo? and make z tend to + 
(with z real for instance). The uniform convergence shows that e*o7f tends 
then to a, hence ay = 0. We proceed similarly for a,, etc. 


2.3. Dirichlet series with positive coefficients 


Proposition 7.—Let f = La,e~*"" be a Dirichlet series whose coefficients 
a, are real 20. Suppose that f converges for R(z) > p, with péR, and that 
the function f can be extended analytically to a function holomorphic in a 
neighborhood of the point z = p. Then there exists a number e > 0 such that f 
converges for R(z) > p—e. 

(In other terms, the domain of convergence of fis bounded by a singularity 
of f located on the real axis.) 

After replacing z by z—p, we can assume that p = 0. Since / is holo- 
morphic for R(z) > 0 and in a neighborhood of 0, it is holomorphic in a 
disc |z—1| S 1+e, with e > 0. In particular, its Taylor series converges in 
this disc. By lemma 1, the pth derivative of fis given by the formula 


f(z) = ¥.a,(—A,)’e7*"* for R(z) > 0; 


hence 


FN) = (= 1) Y Mage 


The Taylor series in question can be written 


fey = ¥ -@-nef), |z-1] S +e 
p=0 p! 
In particular for z = —e, one has 
fl) = Y= 1°11), 
p=0 p! 


the series being convergent. 
But (—1)?f(1) = = A?a,e~*" is a convergent series with positive terms. 


n 


Hence the double series with positive terms 


| P\P27 An 
(9) = Lane (+e) Ale A 
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converges. Rearranging terms, one gets 


oO 


f(-«) = y a,e~?" Y i (1+e)?A? 
n p=0 p! 


_ y ane” inedn(! 8) _ Ya,e""", 
n n 


which shows that the given Dirichlet series converges for z = —e, thus also 
for R(z) > —e. 


2.4. Ordinary Dirichlet series 


This is the case A, = log n. The corresponding series is written 
f(s) = 2. a,|n’, 


the notation s being traditional for the variable. 


Proposition 8.—/f the a, are bounded, there is absolute convergence for 
R(s) > 1. 


This follows from the well known convergence of a I/n* fora > I. 


Pp 
Proposition 9.—I/f the partial sums A,,,) = % a, are bounded, there is 


convergence (not necessarily absolute) for R(s) > 9. 
Assume that |A,,,| S$ K. By applying Abel’s lemma (lemma 2), one 
finds 


m’—} 


[Sin m'| S K( » 


We can suppose that s is real (by prop. 6). This allows us to write the preced- 
ing inequality in the simpler form 


[Sin m’| Ss K/m*, 


~*) 


nn (n+ 1)° 


l l 
m’s 


and the convergence is clear. 


§3. Zeta function and L functions 
3.1. Eulerian products 


Definition 2.—A function f: N — C is called multiplicative if f(1) = 1 and 
f(mn) = f(m)f(n) 


whenever the integers n and m are relatively prime. 
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Examples.—The Euler ¢-function (chap. I, n° 1.2) and the Ramanujan 
function (chap. VII, n° 4.5) are multiplicative. 


Let f be a bounded multiplicative function. 


Lemma 4.—The Dirichlet series a f(n)/n* converges absolutely for 


R(s) > 1 and its sum in this domain is equal to the convergent infinite product 


II (l+f(p)p s+... +f(p")p ™+...). 
pe 


(Here and in the following, P denotes the set of prime numbers.) 

The absolute convergence of the series follows from the fact that f ts 
bounded (prop. 8). Let S be a finite set of prime numbers and let N(S) be 
the set of integers 21 all of whose prime factors belong to S. 

The following equality is immediate: 


1EN(S) pes 


| Y fn =I] ( y foe 


When S increases, the left hand side tends to zu f(n)/n’. From this, one sees 
a= 


that the infinite product converges and that its value is equal to Zf(n)/n’. 


Lemma §.—/f f is multiplicative in the strict sense (i.e. if f(nn’) = f(n)f(n') 
for all pairs n, n’' €eN), one has: 


oo s_ l 
2 for = Woe 


This follows from the above lemma and the identity f(p”) = f(p)™. 
3.2. The zeta function 


Apply the preceding section with f = 1. We obtain the function 


= | ! 
ws) = ¥ 4 = T]—. 
n=1 NM peP 1 — I 
p* 
these formulas making sense for R(s) > 1. 
Proposition 10.—(a) The zeta function is holomorphic and + Q in the half 
plane R(s) > 1. 
(bh) One has: 
l] 
f(s) = —— + (3), 
s—| 


where $(s) is holomorphic for R(s) > 0. 
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Assertion (a) is clear. For (b), we remark that 


oo n+l 

t= [eva = ) | t dt. 

s—! n=1 
| n 

Hence we can write 
n+] x0 n+ 
U(s) = Jt. +) te t ‘dt) = —_ +) (n° —t-*)dt. 
s—l n=1\n° s—] n=1 


Put now: 


Ho) = | Ort—ryd and $9) = ¥ 4,6) 


n 


We have to show that ¢(s) is defined and holomorphic for R(s) > 0. But it is 

clear that each ¢,(s) has those properties, thus it suffices to prove that the 

series Xd, converges normally on all compact sets for R(s) > 0. One has: 
,(s)| S$ sup |n-*—1-*|. 


nStSant+l 


But the derivative of the function n~*—t7* is equal to s/t'*!. From this we 
get: 


+1 


i¢,(s)| < is . with x = R(s), 
Hn 


and the series &¢, converges normally for R(s) 2 e, for all e > 0. 


Corollary 1.—The zeta function has a simple pole for s = 1. 
This is clear. 


Corollary 2.—When s —> 1, one has x p *~ log 1/(s—1), and | 2 _ lp™ 
remains bounded. 
One has: 


log &(s) = d 


peP,k Ge ks 


with d(s) = oa 2, Uke"). The series ¢ is majorized by the series 


= 2, 1/p°+4(s), 


Dlip = Llipo'-V $ Lup(e=1) $ Yo Ann=1) = 1. 


This implies that % 1s bounded, and since cor. 1 shows that log f(s) ~ 


Remark.—Even though it is not necessary for our purpose, it should be 
mentioned that ¢(s) can be extended to a meromorphic function on C with 
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the single pole s = 1. The function &(s) = m7 ‘/?I'(s/2)&(s) is meromorphic 
and satisfies the functional equation &(s) = &(1—s). 
Moreover, the zeta function takes rational values on the negative 
integers: 
{(—2n) = 0 ifn > 0 
C((1—2n) = (—1)"B,/2n ifn > 0, 


where 8B, denotes the mth Bernoulli number (cf. Chap. VII, n° 4.1). 

One conjectures (Riemann hypothesis) that the other zeros of ¢ are on 
the line R(s) = 4. This has been verified numerically for a large number of 
them (more than three million). 


3.3. The L-functions 


Let m be an integer 21 and let y be a character mod m (cf. n° 1.3). The 
corresponding L function is defined by the Dirichlet series 


L(s, x) = y (nn 


Note that, in this sum, it is only the integers n which are prime to m which 
give a non-zero contribution. 
The case of the unit character gives nothing essentially new: 


Proposition 11.—For x = 1, one has 
L(s, 1) = F(s)&(s)_ with F(s) = [] (l1—p~*). 
p|m 


In particular L(s, 1) extends analytically for R(s) > 0 and has a simple pole 
ats = 1. 
This is clear. 


Proposition 12.—For x + 1 the series L(s, x) converges (resp. converges 
absolutely) in the half plane R(s) > 0 (resp. R(s) > 1); one has 


L(s, x) = I] 


per) _ XMP, 
P 


The assertions relative to R(s) > 1 follow from what has been said in 
n° 3.1. It remains to show the convergence of the series for R(s) > 0. Using 
prop. 9, it suffices to see that the sums 


for R(s) > 1. 


Ay» = y x(n), uv, 


are bounded. Now, by prop. 4, we have 


ut+m-l 


yy x(n) = 0. 
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Hence it suffices to majorize the sums A, , for v—u < m, and this is obvious: 


one has 
[Au,ul S (mm). 


The proposition follows. 


Remark.—In particular L(|, x) is finite when y + 1. The essential point 
of Dirichlet’s proof consists in showing that L(1, x) ts different from zero. 
This is the object of the next section. 


3.4. Product of the L functions relative to the same integer m 


In this section, m is a fixed integer 2 |. If p does not divide m, we denote 
by p its image in G(m) = (Z/mZ)* and by /(p) the order of p in the group 
G(m). By definition, f(p) is the smallest integer f > 1 such that p/ = | 
(mod m). We put 


8(p) = o(m)/f(p); 
This is the order of the quotient of G(m) by the subgroup (/) generated by p. 


Lemma 6.—/f ptm, one has the identity 
[Td -x(p)7) = 1-1/0"), 
where the product extends over all characters x of G(m). 


Let W be the set of f(p)-th roots of unity. One has the identity 
[] G-wT7) = 1-77. 


wewW 


Lemma 6 follows from this and the fact that for all w ¢ W there exists g(p) 
characters y of G(m) such that x(p) = w. 


We now define a new function ¢,,(s) by means of the formula 


CS) = [| L(s, x), 
Xx 
the product being extended over all characters x of G(r). 


Proposition 13.— One has 
] 
Ca(s) = |] 


(i l yr ° 
pis 


This is a Dirichlet series, with positive integral coefficients, converging in the 
half plane R(s) > 1. 

Replacing each L function by its product expansion, and applying 
lemma 6 (with T = p’‘), we obtain the product expansion of ¢,,(s). This 
expansion shows clearly that it is a series with positive coefficients; its con- 
vergence for R(s) > 1 1s clear. 
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Theorem 1.—(a) ¢,, has a simple pole at s = }. 
(b) LU, x) #0 for all x + 1. 

If L(1, x) + 0 for all y + 1, the fact that L(s, 1) has a simple pole at 
s = 1 shows that the same is true for ,,. Thus (b) = (a). Suppose now that 
LA, x) = 0 for some x + 1. Then the function ¢,, would be holomorphic 
at s = 1, thus also for all s such that R(s) > 0 (cf. prop. 11 and 12). Since 
it is a Dirichlet series with positive coefficients, this series would converge 
for all s in the same domain (prop. 7). But this is absurd. Indeed, the pth-factor 
of f,, is equal to 

I = — f(P)S 4 py ~2F(p)s (p) 
(ap7 foie ~ +P P +p P +... .)8??, 
and dominates the series 
1p Hs 4 yp 2005 4 
It follows that ¢,, has all its coefficients greater than those of the series 
n em)s 


(n, m)=1 


which diverges for s = — . This concludes the proof. 
¢(m) 

Remark.—The function ¢,, is equal (up to a finite number of factors) 
to the zeta function associated with the field of mth roots of unity. The fact 
that ¢,, has a simple pole at s = 1 can also be deduced from general results on 
zeta functions of algebraic number fields. 


§4. Density and Dirichlet theorem 
4.1. Density 


Let P be the set of prime numbers. We have seen (cor. 2 to prop. 10) 
that, when s tends to | (s being real >1 to fix the ideas) one has 


l ] 
— ~ log ——. 
De BI 


Let A be a subset of P. One says that A has for density a real number k 


when the ratio 
( =) (10 : 
peA Pp s—] 


tends to k when s — |. (Of course, one then has 0 < k $1.) The theorem 
on arithmetic progressions can be refined in the following way: 


Theorem 2,—Let m 2 1 and let a be such that (a, m) = 1. Let P, be the 
set of prime numbers such that p = a(mod m). The set P,, has density 1/4(m). 

(In other words the prime numbers are “equally distributed” between the 
different classes modulo m which are prime to m.) 
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Corollary.—The set P, is infinite. 
Indeed a finite set has density zero. 


4.2. Lemmas 


Let x be a character of G(m). Put 
ffs) = 2 x(p)/P*; 
pim 


this series being convergent for s > 1. 


Lemma 7.—/f x = 1, then f, ~ log fer s—> 1. 
S— 
Indeed, /, differs from the series 21/p* by a finite number of terms only. 


Lemma 8.—/f xy + 1, f, remains bounded when s > 1. 

We use the logarithm of the function L(s, x). It is necessary to make a 
little more precise what we mean by this (due to the fact that “log” is not 
properly speaking a function): 

L(s, x) is defined by the product [[1/(1 —x(p)p’). For R(s) > 1 each factor 


is of the form 1/(1 —«) with |«| < 1. We define log — as or a"/n (““principal”’ 
— O a= 


determination of the logarithm) and we define log L(s, yx) by the series (clearly 
convergent): 


log L(s, x) = ¥ log —_—— (R(s) > 1) 
1—x(p)p 
=) x(p)"/np™ 


(Equivalent definition: take the “‘branch”’ of log L(s, x) in R(s) > 1 which 
becomes 0 when s — +00 on the real axis.) 
We now split log L(s, x) into two parts: 


log L(s, x) = f,(s) + F,) 


F(s) = De Mey ine”. 


with 


Theorem |, together with cor. 2 of prop. 10, shows that log L(s, x) and 
F(s) remain bounded when s — 1. Hence the same holds for f,(s), which 
proves the lemma. 


4.3. Proof of theorem 2 


We have to study the behavior of the function 


g.(s) = ), lip’ 


pePa 
for s— l. 
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Lemma 9.—One has 


84(5) = som rae x(a)" f,6); 


the sum being extended over all characters x of G(m). 
The function Zy(a)~ '/,(s) can be written, by replacing f, by its definition: 


Py (2 x(a~ x(P))/P* 
But x(a~')y(p) = x(a7 *p). By the corollary to prop. 4, we have: 
x(a“ "p) = d6(m) if a~'p = 1 (mod m) 
; = 0 otherwise. 
Hence we find the function ¢(m)g,(s). 


Theorem 2 is now clear. Indeed, lemma 7 shows that f(s) ~ log — 
§s— 


for x = 1, and lemma 8 shows that all other f, remain bounded. Using 


lemma 9, we then see that g,(s) ~ Sm log — , and this means that the 
S-— 
density of P, is —— , q.e.d. 
¢(m) 


4.4. An application 


Proposition 14.—Let a be an integer which is not a square. The set of 


prime numbers p such that (*) = | has density 4. 
Pp 


We can assume that a is square-free. Let m = 4la], let y, be the character 
(mod m) defined in prop. 5 of n° 1.3 and let H < G(m) be the kernel of x, 
in G(m). If p is a prime number not divisible by m, let p be its image in G(m). 


We have ( *) = | if and only if p is contained in H. By th. 2 the set of prime 
P 
numbers verifying this condition has for density the inverse of the index of H 


in G(m), that is to say 4. 


Corollary.—Let a be an integer. If the equation X?—a = 0 has a solution 
modulo p for almost all pe P, it has a solution in Z. 


Remark.—There are analogous results for other types of equations. For 
instance: 
i) let f(x) = agX"+...+a, be a polynomial of degree n with integer co- 
efficients, which is irreducible over Q. Let K be the field generated by the 
roots of f (in an algebraically closed extension of Q) and let N = [K: Q]. 
One has N 2 n. Let P, be the set of prime numbers p such that f ‘decomposes 
completely modulo p”,, i.e. such that all the roots of f (mod p) belong to F.. 


One can prove that P, has density ~ . (The method is analogous to that of 
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the Dirichlet theorem—one uses the fact that the zeta function of the field 
K has a simple pole at s = 1.) One can also give the density of the set P; of 
p such that the reduction of / (mod p) has at least one root in F,; it is a 
number of the form g/N with 1 S$ gq < N (setting aside the trivial case 
where n = 1). 

ii) More generally, let {f,(x,,...,-,)} be a family of polynomials with 
integer coefficients and let Q te the set of p « P such that the reductions of 
f, (mod p) have a common zero in (F,)". It can be proved (see J. Ax, Ann. of 
Maths., 85, 1967, pp. 161-183) that QO has a density; moreover this density 
is a rational number and is zero only if Q is finite. 


4.5. Natural density 


The density used in this paragraph 1s the ‘‘analytic density” (or ‘Dirichlet 
density’). Despite its apparent complexity, it is very convenient. 

There is another notion, that of “‘natural density”: a subset A of P has 
natural density k if the ratio 


number of elements of A which are Sn 
number of elements of P which are Sn 


tends to k when n— o. 

One can prove that, if A has natural density k, the analytic density of A 
exists and is equal to k. On the other hand, there exist sets having an analytic 
density but no natural density. It is the case, for example, of the set P' of 
prime numbers whose first digit (in the decimal system, say) is equal to 1. 
One sees easily, using the prime number theorem, that P’ does not have a 
natural density and on the other hand Bomsier! has shown me a proof that 
the analytic density of P' exists (it is equal to log ,)2 = 0.301029995... ). 

However, this “pathology’”’ does not occur for the sets of prime numbers 
considered above: the set of p¢ P such that p = a (mod m) has a natural 
density (equal to 1/é(m), if a is prime to m); the same holds for the sets 
denoted P,, P;, and Q in the preceding section. For a proof (and an estimate 
of the “‘error term’’) see K. PRACHAR, Primzahlverteilung, V, §7. 


Chapter VII 


Modular Forms 


Sl. The modular group 


1.1. Definitions 


Let H denote the upper half plane of C, i.e. the set of complex numbers 
z whose imaginary part /m(z) is >0. 
Let SL,(R) be the group of matrices (? ) , with real coefficients, such 


that ad—bc = 1. We make SL,(R) act on C= CuU {co} in the following 
way: 


ifg = (? ;) is an element of SL,(R), and if z C, we put 


_ az+b 
cz+d_ 
One checks easily the formula 
Im(z) 
l Im(gz) = — , 
(1) (gz) z+? 


This shows that H is stable under the action of SL,(R). Note that the element 


-l= ( 7 ; 1) of SL,(R) acts trivially on H. We can then consider that 


it is the group PSL,(R) = SL,(R)/{+1} which operates (and this group 
acts faithfully—one can even show that it is the group of all analytic auto- 
morphisms of H). 


Let SL,(Z) be the subgroup of SL,(R) consisting of the matrices with 
coefficients in Z. It is a discrete subgroup of SL,(R). 


Definition 1.—The group G = SL,(Z)/{+1} is called the modular group; 
it is the image of SL,(Z) in PSL,(R). 


Ifg = (? ) is an element of SL,(Z), we often use the same symbol to 


denote its image in the modular group G. 


1.2. Fundamental domain of the modular group 


Let S and T be the elements of G defined respectively by (; 7 ) and 


0 
1 1 
(j ) . One has: 
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Sz = 
S?=1, (ST)3 = 1 


On the other hand, let D be the subset of H formed of all points z such 


that |z| 2 1 and |Re(z)| S$ 1/2. The figure below represents the transforms 
of D by the elements: 


{1, T, TS, ST~'S, S, ST, STS, T~'S, T~'} of the group G. 
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Fig. 1 


We will show that D is a fundamental domain for the action of G on the 
half plane H. More precisely: 


Theorem 1.—(1) For every z € H, there exists g€G such that gze D. 

(2) Suppose that two distinct points z, z’ of D are congruent modulo G. 
Then, R(z) = +4 and z = 2z'+1, or |z| = 1 andz’' = —1/z. 

(3) Let ze D and let I(z) = {glg eG, gz = z} the stabilizer of z in G. 
One has I(z) = {1} except in the following three cases: 

z = I, in which case I(z) is the group of order 2 generated by S; 

z= p = e?""3. in which case I(z) is the group of order 3 generated by ST; 

z= —p = e*">, in which case I(z) is the group of order 3 generated by TS. 


Assertions (1) and (2) imply: 


Corollary.—7he canonical map D — H/G is surjective and its restriction 
to the interior of D is injective. 


Theorem 2.—The group G is generated by S and T. 


Proof of theorems | and 2.—Let G’ be the subgroup of G generated by 


S and 7, and let z e H. We are going to show that there exists g’ ¢ G’ such 
that g'z « D, and this will prove assertion (1) of theorem |. If g = (? ) 


is an element of G’, then 


(1) Im(gz) = 2m) 


|cez+d|?_ 
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Since c and d are integers, the numbers of pairs (c, d) such that |cz+d] is 
less than a given number is finite. This shows that there exists g « G’ such 
that Jm(gz) is maximum. Choose now an integer n such that 7"gz has real 
part between —4 and +4. The element z’ = 7"gz belongs to D; indeed, it 
suffices to see that |z’| 2 1, but if |z’| < 1, the element —1/z’ would have 
an imaginary part strictly larger than /m(z’), which is impossible. Thus the 
element g’ = 7"g has the desired property. 

We now prove assertions (2) and (3) of theorem 1. Let ze D and let 


2 -(° } eG such that gz  D. Being free to replace (z, g) by (gz, 7°), 


we may suppose that Jm(gz) 2 Im(z), i.e. that |cz+d]| is $1. This is clearly 
impossible if {c| 2 2, leaving then the cases c = 0, 1, —1. If c = 0, we have 
d= +1 and g is the translation by +5. Since R(z) and R(gz) are both 
between — 4 and 4, this implies either 6 = 0 and g = 1 or b = +1 in which 
case one of the numbers R(z) and R(gz) must be equal to — 4 and the other 
to 4. If c = 1, the fact that |z+d/ is <1 implies d = 0 except if z = p (resp. 
— 6) in which case we can have d = 0, | (resp, d = 0, —1). The case d = 0 
gives |z| < 1 hence |z| = 1; on the other hand, ad—bc = 1 implies b = —1, 
hence gz = a—1/z and the first part of the discussion proves that a = 0 
except if R(z) = +4, 1.e. if z = por —f in which case we have a = 0, —1 or 
a =0, 1. The case z = p, d = | givesa—b = | and gp = a—I/(1+p) = a+p, 
hence a = 0,1; we argue similarly when z = —6,d = —1. Finally the case 
c = —I leads to the case c = | by changing the signs of a, b, c, d (which 
does not change g, viewed as an element of G). This completes the verific- 
ation of assertions (2) and (3). 

It remains to prove that G’ = G. Let g be an element of G. Choose a 
point Z, interior to D (for example z,. = 2i), and let z = gz). We have seen 
above that there exists g’ ¢ G’ such that g’z e D. The points Zz) and g’z = 
g'gz, of D are congruent modulo G, and one of them is interior to D. 
By (2) and (3), it follows that these points coincide and that g’g = 1. Hence 
we have g « G’, which completes the proof. 


Remark.—One can show that <S, 7; S?, (ST)*> is a presentation of G, 
or, equivalently, that G is the free product of the cyclic group of order 2 
generated by S and the cyclic group of order 3 generated by ST. 


§2. Modular functions 
2.1. Definitions 
Definition 2.—Let k be an integer. We say a function f is weakly modular 
of weight 2k" if f is meromorphic on the half plane H and verifies the relation 


(2) f(z) = (ez+d)" "F (2+) forall (< 2) e SL,(Z). 


CZ + 


1) Some authors say that f is ‘‘of weight — 2k", others that f is “of weight &”. 
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“) We have d(gz)/dz = (cz+d)~?. The 


Let g be the image in G of (? d 


relation (2) can then be written: 


f(gz) _ (agz)\~ 
fe) \ az 


Or 


(3) f(gz)d(gz)* = f(z)dz*. 


It means that the “‘differential form of weight k” f(z)dz* is invariant under G. 
Since G is generated by the elements S and T (see th. 2), it suffices to check 
the invariance by S and by 7. This gives: 


Proposition 1.—Let f be meromorphic on H. The function f is a weakly 
modular function of weight 2k if and only if it satisfies the two relations: 


(4) f(zZ+1) = fl) 
(5) f(-1z) = 27"f(2). 


Suppose the relation (4) is verified. We can then express f as a function 
of q = e?*'?, function which we will denote by f; it is meromorphic in the 
disk |g| < 1 with the origin removed. If f extends to a meromorphic (resp. 
holomorphic) function at the origin, we say, by abuse of language, that f 
is meromorphic (resp. holomorphic) at infinity. This means that f admits a 
Laurent expansion in a neighborhood of the origin 


{[@= + ang” 


where the a, are zero for n small enough (resp. for n < Q). 


Definition 3.—A weakly modular function is called modular if it is mero- 
morphic at infinity. 

When f is holomorphic at infinity, we set f(0o) = f(0). This is the value 
of f at infinity. 


Definition 4.—A modular function which is holomorphic everywhere 
(including infinity) is called a modular form; if such a function is zero at 
infinity, it is called a cusp form (‘‘Spitzenform” in German—“‘forme para- 
bolique”’ in French). 

A modular form of weight 2k is thus given by a series 


6) fle) = ¥ ag" = Y° aye 


which converges for |q| <1 (i.e. for /m(z) > 0), and which verifies the 
identity 
(5) S(=1]z) = 27*f(2). 


It is a cusp form if a) = 0. 
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Examples 

1) If fand f’ are modular forms of weight 2k and 2k’, the product ff’ is a 
modular form of weight 2k+2k’. 

2) We will see later that the function 


q [] ('-q")** = q— 2447 +252q? —1472q* +... 
n=1 
is a cusp form of weight 12. 
2.2. Lattice functions and modular functions 


We recall first what is a /attice in a real vector space V of finite dimension. 
It is a subgroup T° of V verifying one of the following equivalent conditions: 


1) I’ 1s discrete and V/T is compact; 
li) I‘ 1s discrete and generates the R-vector space V; 
i) There exists an R-basis (e,,...,¢,) of V which is a Z-basis of I (i.e. 
l = Ze, ®... @ Ze,). 

Let & be the set of lattices of C considered as an R-vector space. Let M 
be the set of pairs (w,, w,) of elements of C* such that Jm(w,/w,) > 0; to 
such a pair we associate the lattice 


P(w,, w,) = Zw, © Zw, 
with basis {w,, w,}. We thus obtain a map M — & which is clearly surjective. 


Let g = (? 2) e SL,(Z) and let (w,, w,) ¢ M. We put 


Wy} = aw, + bw, and Ww = cw, + dus. 
It is clear that {w;, w,} is a basis of '(w,, w,). Moreover, if we set z = w,/w, 
and z’ = w,/w,, we have 
, az+b 
££ = -_ = 
cz+d 


gz. 


This shows that Jm(z’) > 0, hence that (w;, w5) belongs to M. 


Proposition 2.—For two elements of M to define the same lattice it is 
necessary and sufficient that they are congruent modulo SL,(Z). 

We just saw that the condition is sufficient. Conversely, if (w,, w,) and 
(w,, 3) are two elements of M which define the same lattice, there exists 


, , ab ; , 
an integer matrix g = (? i} of determinant +1 which transforms the first 


basis into the second. If det(g) was <0, the sign of Jm(w;/w;) would be 
the opposite of Im(w,/w.) as one sees by an immediate computation. The 
two signs being the same, we have necessarily det(g) = 1 which proves the 
proposition. 

Hence we can identify the set & of lattices of C with the quotient of M 
by the action of SL,(Z). 
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Make now C* act on & (resp. on M) by: 


Pre AP (resp. (w,, w2)h+ (Aw,, Aw,)), AEC*. 


The quotient M/C* is identified with H by (w,, w,)h+ Zz = w,/w,, and 
this identification transforms the action of SL,(Z) on M into that of 
G = SL,(Z)/{+1} on H (cf. n° 1.1). Hence: 


Proposition 3.—The map (w,, w,) ++ w,/w, gives by passing to the quotient, 
a bijection of #/C* onto H/G. (Thus, an element of H/G can be identified 
with a lattice of C defined up to a homothety.) 


Remark.—Let us associate to a lattice I’ of C the elliptic curve Er = C/T. 
It is easy to see that two lattices I’ and I’’ define isomorphic elliptic curves 
if and only if they are homothetic. This gives a third description of H/G = 
AR/C*: it is the set of isomorphism classes of elliptic curves. 


Let us pass now to modular functions. Let F be a function on &, with 
complex values, and let k « Z. We say that F is of weight 2k if 


(7) F(Ar) = A~**F(L) 


for all lattices [ and all Ae C*. 
Let F be such a function. If (w,, w,) ¢ M, we denote by F(w,, w,) the 
value of F on the lattice ['(w,, w,). The formula (7) translates to: 


Moreover, F(w,, w>) is invariant by the action of SL,(Z) on M. 
Formula (8) shows that the product w3?*F(w,, w,) depends only on 
z = w,/w,. There exists then a function f on H such that 


(9) F(w,, w2) = wz *f(w,/a). 
Writing that F is invariant by SL,(Z), we see that / satisfies the identity: 


(2) f(z) = (ez+d) *F az +b for all ( 7 b e SL,(Z). 
cz+d c da 

Conversely, if f verifies (2), formula (9) associates to it a function F on & 

which is of weight 2k. We can thus identify modular functions of weight 2k 

with some Jattice functions of weight 2k. 


2.3. Examples of modular functions; Eisenstein series 


Lemma 1.—Let 1 be a lattice in C. The series X’ 1/|y|’ is convergent for 
a>2 | vel 

(The symbol =’ signifies that the summation runs over the nonzero 
elements of I.) 

We can proceed as with the series 21/n*, 1.e. majorize the series under 


consideration by a multiple of the double integral | leery 
x 


-- extended 
+ y*)7/? 
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over the plane deprived of a disk with center 0. The double integral is easily 
computed using “polar coordinates’’. Another method, essentially equivalent, 
consists in remarking that the number of elements of I such that [y| is 
between two consecutive integers n and n+I 1s O(n); the convergence of the 


series is thus reduced to that of the series Z1/n’~'. 


Now let k be an integer > 1. If T is a lattice of C, put 
(10) GT) = )" 1/y**. 
yer 


This series converges absolutely, thanks to lemma |. It is clear that G, is 
of weight 2k. It is called the Eisenstein series of index k (or index 2k following 
other authors). As in the preceding section, we can view G, as a function on 
M, given by: 

’ l 
I] G,(w,, #,) = )° ————_—__ . 
(11) x(@1, @2) d (mo, tne, 
Here again the symbol =’ means that the summation runs over all pairs of 
integers (m, n) distinct from (0, 0). The function on H corresponding to G, 
(by the procedure given in the preceding section) is again denoted by G,. 
By formulas (9) and (11), we have 

' l 
12) G,(z) = —___.. , 
( i) d (mz+n)** 

Proposition 4.—Let k be an integer >1. The Eisenstein series G,(z) is a 
modular form of weight 2k. We have G,(oo) = 2¢(2k) where ¢ denotes the 
Riemann zeta function. 

The above arguments show that G,(z) is weakly modular of weight 2k. 
We have to show that G, is everywhere holomorphic (including infinity). 
First suppose that z is contained in the fundamental domain D (cf. n° 1.2). 
Then 


m?zZ +2mnR(z) +n? 
2 


|mz+n|? 


IV 


m? —mn+n? = |mp—n|’. 


By lemma 1, the series '1/|mp—n|?* is convergent. This shows that the 
series G,(z) converges normally in D, thus also (applying the result to G,(g~ 'z) 
with g € G) in each of the transforms gD of D by G. Since these cover H 
(th. 1), we see that G, is holomorphic in H. It remains to see that G, is 
holomorphic at infinity (and to find the value at this point). This amounts 
to proving that G, has a limit for Jm(z) — oo. But one may suppose that z 
remains in the fundamental domain D; in view of the uniform convergence 
in D, we can make the passage to the limit term by term. The terms 
I /(mz+n)?* relative to m + 0 give 0; the others give 1/n2*. Thus 


lim.G,(z) = )'1/n?* = 2 1/n?* = 22(2k) —q.e.d. 
n= 1 


Remark.—We give in n° 4.2 below the expansion of G, as a power series 
ing = e7*!?, 
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Examples.—The Eisenstein series of lowest weights are G, and G3, 
which are of weight 4 and 6. It is convenient (because of the theory of 
elliptic curves) to replace these by multiples: 


(13) £2 = 60G,, 83 = 140G,. 


We have g,(0o) = 1202(4) and g3(cc) = 2802(6). Using the known values 
of ¢(4) and &(6) (see for example n° 4.1 below), one finds: 


4 8 
(14) g2(0) = 37° 23(00) = mv 7°. 
If we put 
(15) A = g3-27g}, 


we have A(oo) = OQ; that is to say, A is a cusp form of weight 12. 


Relation with elliptic curves 
Let T° be a lattice of C and let 


] , l l 
16 uy =—4+S'(— = 
(16) pr) = 5+ (ay: 7) 
be the corresponding Weierstrass function", The G,(I’) occur into the 
Laurent expansion of (,-: 


(17) ((u) = — + y (2k —1)G,(T)u2*-?, 


If we put x = @,(u), y = (uu), we have 
(18) y? = 4x°—g,x—g,, 


with g, = 60G,(T), g3; = 140G,(1) as above. Up to a numerical factor, 
A = g}—27g3 is equal to the discriminant of the polynomial 4x? — g,x—g3. 

One proves that the cubic defined by the equation (18) in the projective 
plane is isomorphic to the elliptic curve C/T’. In particular, it is a nonsingular 
curve, and this shows that A is +0. 


§3. The space of modular forms 
3.1. The zeros and poles of a modular function 


Let f be a meromorphic function on H, not identically zero, and let p 
be a point of H. The integer n such that //(z—p)" is holomorphic and non- 
zero at p is called the order of f at p and is denoted by »,(/). 


“) See for example H. CaRTAN, Théorie élémentaire des fonctions analytiques d’une ou 
plusieurs variables complexes, chap. V, §2, n° 5. (English translation: Addison-Wesley Co.) 
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When f is a modular function of weight 2k, the identity 


fiz) = cxtays(S07) 
cz+d 

shows that v,(f) = v,)(f) if g ¢G. In other terms, v,(/) depends only on 
the image of p in H/G. Moreover one can define v,,(f) as the order for 
qg = 0 of the function f(q) associated to f (cf. n° 2.1). 

Finally, we will denote by e, the order of the stabilizer of the point p; 
we have e, = 2 (resp. e, = 3) if p is congruent modulo G to i (resp. to p) 
and e, = | otherwise, cf. th. 1. 


Theorem 3.—Let f be a modular function of weight 2k, not identically 
zero. One has: 

l k 

(19) vf) + Y —v(f)=-. 

PEHIG Cy 6 

[We can also write this formula in the form 


(20) rol) +5 0(f) + of) + Y* o(f) = « 
2 3 peH|G 6 


where the symbol &* means a summation over the points of H/G distinct 
from the classes of i and p.] 

Observe first that the sum written in th. 3 makes sense, i.e. that f has 
only a finite number of zeros and poles modulo G. Indeed, since f is mero- 
morphic, there exists r > 0 such that f has no zero nor pole for 0 < |q| < r; 


this means that fhas no zero nor pole for Jm(z) > 5 log (1/r). Now, the part 
TT 

] 

20 


(l/r) is compact; since fis meromorphic in H, it has only a finite number of 
zeros and of poles in D,, hence our assertion. 


df 


To prove theorem 3, we will integrate - I = on the boundary of D. More 
precisely : din f 


D, of the fundamental domain D defined by the inequality Jm(z) < —log 


1) Suppose that fhas no zero nor poleon the boundary of Dexcept possibly 
i, p, and — . There exists a contour @ as represented in Fig. 2 whose interior 
contains a representative of each zero or pole of f not congruent to i or p. 
By the residue theorem we have 
1 { df + 
dni | f ho AS 
€ 
On the other hand: 
a) The change of variables g = e?*'? transforms the arc EA into a circle w 
centered at q = 0, with negative orientation, and not enclosing any zero or 
pole of f except possibly 0. Hence 
A 
1 jdf 1 [df _ 


om | fm lf —v,(f). 
E w 
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Fig. 2 


7 


b) The integral of I fon the circle which contains the arc BB’, oriented 
i 
negatively, has the value —v,(f). When the radius of this circle tends to 0, 


oN 2m 
the angle BB’ tends to S. Hence: 


c 

1{F,_! 

el 7 5) 
c 

I (af l 

rl Weer 
D 


d) S transforms the arc B’C onto the arc DC’; since f(Sz) = z7*f(z), we 
get: 

df(S2z) _ 5,4 | afte) 

f(Sz) z f(z)’ 
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hence: 


I * af l "aif 4 *(dflz) df(Sz) 
2in \4 + 3m | fin JV FS 
B’ Cc’ 


B’ 
Cc 


l dz 
-a5 | (-% =) 
J 


l k 
—2k{ —-—)=- 
~ ( ) 6 
when the radii of the arcs BB’, CC’, DD’, tend to 0. 


Writing now that the two expressions we get for - | 4 are equal, and 
[17 


passing to the limit, we find formula (20). . 


2) Suppose that f has a zero or a pole A on the half line 
{21Rete) = — * , Im(z) > vt. 


We repeat the above proof with a contour modified in a neighborhood of A 
and of 7A as in Fig, 3. (The arc circling around 7A is the transform by T of 
the arc circling around A.) 


-TA 


-1 -% 0 4 1 


Fig. 3 


We proceed in an analogous way if f has several zeros or poles on the 
boundary of D. 


Remark.—This somewhat laborious proof could have been avoided if 
one had defined a complex analytic structure on the compactification of H/G 
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(see for instance Seminar on Complex Multiplication, Lecture Notes on Math., 
n° 21, lecture IT). 


3.2. The algebra of modular forms 


If k is an integer, we denote by M, (resp. M9) the C-vector space of 
modular forms of weight 2k (resp. of cusp forms of weight 2k) cf. n° 2.1, 
def. 4. By definition, M° is the kernel of the linear form fr (00) on M,. 
Thus we have dim M,/M? S 1. Moreover, for k 2 2, the Eisenstein series 
G, is an element of M, such that G,(0o) + 0, cf. n° 2.3, prop. 4. Hence we 
have 

M, = M2 @C.G, (fork = 2). 


Finally recall that one denotes by A the element g}—27g2 of M@ where 


Theorem 4.—(i) We have M, = 0 fork < 0 and k = 1. 

(ii) Fork = 0, 2, 3, 4, 5, M, is a vector space of dimension | with basis |, 
G,, G3, G4, G5; we have M® = 0. 

(iii) Multiplication by A defines an isomorphism of M,- onto MQ. 

Let f be a nonzero element of M,. All the terms on the left side of the 
formula 


(20) vo(f) + vf ) + Ane] )+ Y* off) = k 
peH|G 6 


are >0. Thus we have k 2 0 and also k + 1, since 4 cannot be written in 
the form n+n’'/2+n"/3 with n,n’, n" 2 0. This proves (i). 

Now apply (20) to f = G,, k = 2. We can write 2 in the form n+n'/2 
+n"/3,n,n',n" 2 0 only forn = 0, n' = 0, n" = 1. This shows that v,(G;) 
= 1 and v,(G,) = 0 for p + p (modulo G). The same argument applies to 
G, and proves that v(G3) = 1 and that all the others v,(G3) are zero. This 
already shows that A is not zero at i, hence is not identically zero. Since the 
weight of A is 12 and v,(A) 2 1, formula (20) implies that v,(A) = 0 for 
p + © and v,(A) = I. In other words, A does not vanish on H and has a 
simple zero at infinity. If f is an element of M? and if we set g = f/A, it is 
clear that g is of weight 2k —12. Moreover, the formula 


vf) if p + © 

v(f)—1 ifp = 0c 

shows that v,(g) is 20 for all p, thus that g belongs to M, _ ,, which proves (ili). 
Finally, if k < 5, we have k—6 <0 and M? = 0 by (i) and (iii); this 

shows that dim M, S$ 1. Since 1, G,, G3, G4, Gs; are nonzero elements of 

My, M2, M;, M4, Ms, we have dim M, = 1 for k = 0, 2, 3, 4, 5, which 

proves (ii). 


b(g) = v,(f)—»,(A) = 


Corollary 1.—We have 
(21) dim M, = [k/6] ifk = 1(mod 6),k 2 0 
[k/6]J+1 ifk # 1 (mod 6), k 2 0. 
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(Recall that [x] denotes the integral part of x, i.e. the largest integer 1 such 
thatn S x.) 

Formula (21) is true for 0 S$ k < 6 by (i) and (ii). Moreover, the two 
expressions increase by one unit when we replace k by k+6 (cf. (ii)). The 
formula is thus true for all k 2 0. 


Corollary 2.—The space M, has for basis the family of monomials G3G5 
with «, B integers 20 and 2a+3B = k. 

We show first that these monomials generate M,. This is clear for k $ 3 
by (i) and (ii). For k 2 4 we argue by induction on k. Choose a pair (y, 5) 
of integers 20 such that 2y+38 = k (this is possible for all k 2 2). The 
modular form g = G3G} is not zero at infinity. If fe M,, there exists 4 ¢C 
such that f—Ag is a cusp form, hence equal to Ah with he M,_., cf. (ili). 
One then applies the inductive hypothesis to h. 

It remains to see that the above monomials are linearly independent; 
if they were not, the function G3/G? would verify a nontrivial algebraic 
equation with coefficients in C, thus would be constant, which is absurd 
because G, is zero at p but not G3. 


Remark.—Let M = 7 M, be the graded algebra which 1s the direct 


sum of the M, and let e: C[X, Y] > M be the homomorphism which maps 
X on G, and Y on G;. Cor. 2 is equivalent to saying that e is an isomorphism. 
Hence, one can identify M@ with the polynomial algebra C[G, G3}. 


3.3. The modular invariant 


We put: 
(22) j = 1728g3/A. 


Proposition 5.—(a) The function j is a modular function of weight 0. 

(b) Jt is holomorphic in H and has a simple pole at infinity. 

(c) /t defines by passage to quotient a bijection of H/G onto C. 

Assertion (a) comes from the fact that g3 and A are both of weight 12; 
(b) comes from the fact that A is +0 on H and has a simple zero at infinity, 
while g, is nonzero at infinity. To prove (c), one has to show that, if A eC, 
the modular form f, = 1728g3—AA has a unique zero modulo G. To see 
this, one applies formula (20) with f = f, and k = 6. The only decompositions 
of k/6 = 1 in the form n+n'/24+n°/3 with n, n’, n” 2 0 correspond to 


(n, n’, n") = (1,0, 0) or (0, 2, 0) or (0, 0, 3). 
This shows that f, is zero at one and only one point of H/G. 


Proposition 6.—Let f be a meromorphic function on H. The following 
properties are equivalent: 


(i) f is a modular function of weight 0; 
(ii) f is a quotient of two modular forms of the same weight; 
(iit) f is a rational function of j. 
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The implications (iii) => (11) => (1) are immediate. We show that (i) => (iii). 
Let f be a modular function. Being free to multiply f by a suitable polynomial 
in j, we can suppose that f is holomorphic on H. Since A is zero at infinity, 
there exists an integer n 2 0 such that g = A’f is holomorphic at infinity. 
The function g is then a modular form of weight 12”; by cor. 2 of theorem 4 
we can write it as a linear combination of the G3G with 2«+ 38 = 6n. By 
linearity, we are reduced to the case g = G3G}, i.e. f = G3G3/A". But the 
relation 2«+38 = 6n shows that p = a/2 and q = 8/3 are integers and one 
has f = G}?G34/A?**. Thus we are reduced to see that G3/A and G3/A are 
rational functions of j, which is obvious. 


Remarks.—1) As stated above, it is possible to define in a natural way 
iia’ 
a structure of complex analytic manifold on the compactification H/G of 


H/G. Prop. 5 means then that j defines an isomorphism of ae onto the 
Riemann sphere S, = CU{oo}. As for prop. 6, it amounts to the well 
known fact that the only meromorphic functions on S, are the rational 
functions. 

2) The coefficient 1728 = 2°3° has been introduced in order that j has 
a residue equal to | at infinity. More precisely, the series expansions of §4 
show that: 


(23) j(z) = ! + 744 + y c(n)q", ze H,g = e?"". 
q n=1 


One has: 
c(1) = 2737 1823 = 196884, c(2) = 2''5.2099 = 21493760. 

The c(m) are integers; they enjoy remarkable divisibility properties''?: 
n = 0(mod 2%) = c(n) = 0(mod 2°7*8) = ifa>] 
n = 0(mod 3%) = c() = 0 (mod 3747 3) ” 
n = 0(mod 5%) => c(n) = 0 (mod 5**') ” 
n = 0(mod 7") = c(n) = 0 (mod 7°) 
n = 0(mod 11°) = c(n) = 0 (mod 11°). 


$4. Expansions at infinity 
4.1. The Bernoulli numbers B, 
They are defined by the power series expansion:‘?? 


“) See on this subject A. O. L. ATKIN and J. N. O'BRIEN, Trans. Amer. Math. Soc., 
126, 1967, as well as the paper of ATKIN in Computers in mathematical research (North 
Holland, 1968). 


‘2) In the literature, one also finds **Bernoulli numbers’? 6, defined by 


~_= DV bxtik!, 
ex—1 = 
hence bo = l, b, = — 1/2, bona =QOifk> I, and ba. = (—1)*-'B,. 


The 6 notation is better adapted to the study of congruence properties, and also to general- 
izations a la Leopoldt. 
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x 
24 =1-74 —1)**'B 
(24) a 5 y (-1)"'B, OB 
Numerical table 
l l l ] 5 691 
B,=-,B8B, =—,B,=—, Bh =—, 8B =—,; = —— 
16’? 30’ 2 42’? * 30" 66” *—s2730 
7 3617 43867 283.617 11.131.593 
B, =—, By = —~, By = ——,, By = ——_ ; 8 = — >> 
6 510 798 330 138 
103.2294797 13.657931 7.9349.362903 
B,, = ———__ , 3 0= —,, By = ————_.. 
2730 6 870 


The B, give the values of the Riemann zeta function for the positive even 
integers (and also for the negative odd integers): 


Proposition 7.—/f k is an integer 21, then: 


k-1 
25 2k B,n?*. 
( ) c( )= Ob qT 
The identity 
96 , a0 B J2ko2k 
tez=1—- 
(26) 70082 x Fe ip: 


follows from the definition of the B, by putting x = 2iz. Moreover, taking 
the logarithmic derivative of 


a) z2 
(27) sinz = 2 [1 (1 aa) 
= n? nw 
we get: 
7? 
(28) zcotgz = 1+2 y= 
n= 127 —pn? a? 


Comparing (26) and (28), we get (25). 


4 6 
E I 2 a 4 = ad 5 6 = ” ) 
xamples {(2) = 23 » O(4) 7325 (6) 33.5.7 
ari 9 6912 
8 ] => ~~ 46 «39721112? 
C(8) = 33 233.527 » (10) = 35.5.7.11 » 812) 3°.59.77.11.13 
14 
4) = — 2" 


39,52.7.11.13 | 
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4.2. Series expansions of the functions G, 


We now give the Taylor expansion of the Eisenstein series G,(z) with 
respect tog = e2"", 
Let us start with the well known formula: 


(29) 7 cotg mz = ! +) ( + a) 
Zz m=1\Z+m z—-m 
We have on the other hand: 
(30) mcotg az = 7 OS® me = init! = |r — fin = in —2in y q", 
SIN 7Z q—-1 l-—g n=0 


Comparing, we get: 


31) i+ 3 (sy 


By successive differentiations of (31), we obtain the following formula 
(valid for k 2 2): 


yim. z—m 


Jt —_- < k-1l1n 
©) on (m+2~ k- pi? oy me 


Denote now by o,(n) the sum 2 d* of kth-powers of positive divisors of n. 


Proposition 8.—¥For every integer k = 2, one has: 


ae Dy Pan— 1079". 


(33) G,(z) = 24(2k)+2 


We expand: 
] 


G,(z) = 2 
* (n, m)#=-(0,0) (nz +m)** 


2QH4+25 y --! 


n=1 mez (n nz+m)* 


Applying (32) with z replaced by nz, we get 


- 2(— 2ni)™ ¥ kg 
G,(z) = 2¢(2k) + (Ok—1yt ah x d 

_ 2(2ni)** 2 h 

= 20(2k) + Qk—1! >» 72,-1(1)q"- 


Corollary.—-G,(z) = 2¢(2k)E,(z) with 


(34) E,(z) = 1+y, 2. O24 1(71)q” 
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and 


(35) y, =(-lI'--. 


k 


One defines £,(z) as the quotient of G,(z) by 2¢(2k); it is clear that E,(z) 
is given by (34). The coefficient y, 1s computed using prop. 7: 


= Gn 1 aA ayy tt 
*  Qk=1)! 2k) (Qk—1)! 224" By? 
Examples 

E, = 1+240 }) o3(n)q", a2 = (2n)" 35 E 
a=1 

E, = 1-504 >» a5(n)q", 3 = (2z)° 5333 73 3 

E, = 14480 )° o,(n)q" (480 = 2°.3.5) 
n=1 

E, = 1-264 ¥° o(n)q" (264 = 27.3.11) 
n= 1 

E, =1+ re) d 0, ,(n)q" (65520 = 2*.3.5.7.13) 


E, =1-24 y o,3(n)q". 


Remark.—We have seen in n° 3.2 that the space of modular forms of 
weight 8 (resp. 10) is of dimension 1. Hence: 


(36) E3 = Ey, E,Es = Es. 


This is equivalent to the identities: 


nmi 
a(n) = o3(n)+120 ys a;(m)o3(n—m) 


n-1 
Llog(n) = 2los(n)—1003;(n) +5040 }° 03(n)o5(n—m). 
m= 1 
More generally, every £, can be expressed as a polynomial in E, and £3. 


4.3. Estimates for the coefficients of modular forms 


Let 


(37) f(z) = y" ang" (q = e**") 


n=O 
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be a modular form of weight 2k, k 2 2. We are interested in the growth of 
the a,: 


Proposition 9.—/f f = G,, the order of magnitude of a, is n**~'. More 
precisely, there exist two constants A, B > 0 such that 


(38) An**-! < Ja,| < Bn**-'. 
Prop. 8 shows that there exists a constant A > O such that 
a, = (—1)*Aoy,-,(n), hence |a,| = Ac,,-,(n) 2 An?*~!. 
On the other hand: 
Maal =AY _ <A y a = Al(2k—-1) < +0. 
d d 


d=1 


Theorem 5 (Hecke).—/f f is a cusp form of weight 2k, then 
(39) a, = O(n"). 
la, 


(In other words, the quotient —~ remains bounded when n — oo.) 
n 
Because f is a cusp form, we have a, = 0 and can factor q out of the 
expansion (37) of /, Hence: 


(40) =| f(z)| = O(g) = O(e~2"”) with y = Im(z), when tends to 0. 


Let 4(z) = | f(z)|y*. Formulas (1) and (2) show that ¢ is invariant under 
the modular group G. In addition, ¢ is continuous on the fundamental 
domain D and formula (40) shows that ¢ tends to 0 for y — oo. This implies 
that ¢ is bounded, i.e. there exists a constant M such that 


(41) If(z)| S$ My" forze H. 


Fix y and vary x between 0 and 1. The point g = e?***™ runs along a 
circle C, of center 0. By the residue formula, 


i 
! _ a 
a= | fe)q7""'dq = | flxtiy)q7"de. 
27 
¢, 6 


(One could also deduce this formula from that giving the Fourier coefficients 
of a periodic function.) 
Using (41), we get from this 


la,| < My~*e?™”, 


This inequality is valid for all y > 0. For y = I/n, it gives |a,| S$ e?"Mn"*. 
The theorem follows from this. 


Corollary.—/f f is not a cusp form, then the order of magnitude of a, 
. 2k-1 
iS n . 


We write fin the form AG,+A with A + 0 and a cusp form / and we 
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apply prop. 9 and th. 5, taking into account the fact that n* is “negligible” 
compared to n7*~!. 


Remark.—The exponent k of theorem 5 can be improved. Indeed, 
Deligne has shown (cf. 5.6.3 below) that 
a, = O(n*~*o9(n)), 
where o,() is the number of positive divisors of n. This implies that 


a, = O(n*~'/7**)—— forevery « > 0. 
4.4. Expansion of A 


Recall that 
A = g3—27g3 = (27)'72>°3> 9(E3— E3) 


(42) 
= (2m)'7(q—24q? +252q3 —14729* +. ..). 


Theorem 6 (Jacobi).—A = (2m)'*q M1 (1 —g")?*. 


[This formula is proved in the most natural way by using elliptic functions. 
Since this method would take us too far afield, we sketch below a different 
proof, which is “‘elementary’”’ but somewhat artificial; for more details, the 
reader can look into A. Hurwitz, Math. Werke, Bd. I, pp. 578-595.] 

We put: 


(43) Fe) = 4 T] (a. 


To prove that F and A are proportional, it suffices to show that F is a 
modular form of weight 12; indeed, the fact that the expansion of G has 
constant term zero will show that F is a cusp form and we know (th. 4) that 
the space M® of cusp forms of weight 12 is of dimension !. By prop. 1 of 
n° 2.1, all there is to do is to prove that: 


(44) F(—1/z) = z'*F(z). 


We use for this the double series 


ote H@) = DY) ———— 


m—1+n2) (m+nz) (m—1+nz) (m+nz) 
where the sign &’ indicates that (m,n) runs through all meZ, neZ with 
(m,n) # (0,0) for G and G, and (m,n) # (0,0), (1,0) for H and H,. (Notice 


the order of the summations!) 
The series H, and H are easy to calculate explicitly because of the formula: 
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l l l 


(m—1+nz)(m+nz) ~ m—1-+nz 7 M+ NZ 
One finds that they converge, and that 
H, = 2, H = 2—2zilz. 
Moreover, the double series with general term 


| l l 
(m—1+nz) (m+nz) 7 (m+nz)? — (m+nz)? (m—14n2) 


is absolutely summable. This shows that G,—H, and G—H coincide, thus 
that the series G and G, converge (with order of summation indicated) and 
that 


G,(2)— G2) = Hy@)-H@) = = 


It is clear moreover that G,(—1/z) = z?G(z). Hence: 
(45) G,(—1/z) = z7G,(z)—2ziz. 


On the other hand, a computation similar to that of prop. 8 gives 
(46) G,(z) = — — 87? >> o,(n)q". 
Now, go back to the function F defined by (43). Its logarithmic differential is 


(47) < = 4 (1- 74 s 
q m= 


ng m) = a (1-24 3 y o,(n)q") . 


By comparing with (46), we get: 


(48) <= = Gy(e)dz 


Combining (45) and (48), we have 


aF(—A/z) _ Oi 6 ¢_ 4), % _ Sb az 7 
F(o1)) > G,(—1/z) 2p 5 (2°G,(z) — 2ziz) 
” _aF (2) , 494 
F(z) Zz 


Thus the two functions F(—1/z) and z'?F(z) have the same logarithmic 
differential. Hence there exists a constant k such that F(—1/z) = kz!?F(z) 
for all z¢ H. For z =i, we have z'* = 1, —1/z = z and F(z) + 0; this 
shows that k = 1, which proves (44), q.e.d. 


Remark.—One finds another ‘“‘elementary” proof of identity (44) in 
C. L. SIEGEL, Gesamm. Abh., III, n° 62. See also Seminar on complex multi- 
plication, III, §6. 
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4.5. The Ramanujan function 


We denote by 7() the th coefficient of the cusp form F(z) = (27) '? A(z). 
Thus 


(50) d, r(nq" = 4 [] 4"). 
The function n+ 7(n) is called the Ramanujan function. 


Numerical table © 


r(1) = 1, 7(2) = —24, 7(3) = 252, 7(4) = —1472, 7(5) = 4830, 
1(6) = —6048, 7(7) = —16744, 7(8) = 84480, 7(9) = —113643, 
7(10) = —115920, r(11) = 534612, 7(12) = —370944. 


Properties of +(n) 
(51) r(n) = O(n’), 


because A is of weight 12, cf. n° 4.3, th. 5. (By Deligne’s theorem, we even 
have z(n) = O(n''/***) for every « > 0.) 


($2) (nm) = r(n)t(m)_ if (n,m) = 1 

(53) r(p"*') = +(p)r(p")—p''1(p""') for p prime, n > 1, cf. n° 5.5. below. 
The identities (52) and (53) were conjectured by Ramanujan and first proved 
by Mordell. One can restate them by saying that the Dirichlet series 
LAs) = E x(n)/n? has the following eulerian expansion: 


I 


netomat inns 


(54) LAs) = I] W122s? cf. n° §.4. 


peP 1—r(p)p *+p 
By a theorem of Hecke (cf. n° 5.4) the function L, extends to an entire 
function in the complex plane and the function 
(27) *P(s)L(5) 
is invariant by s+> 12—s. 
The 7(n) enjoy various congruences modulo 2!7, 3°, 53, 7, 23, 691. We 
quote some special cases (without proof): 


(55) 7(n) = n*o,(n) (mod 3°) 
(56) t(n) = no;(n)_ (mod 7) 
(57) r(n) = 0,,(n) (mod 691). 


For other examples, and their interpretation in terms of “‘/-adic repre- 
sentations” see Sém. Delange-Pisot-Poitou 1967/68, exposé 14, Sém. Bourbaki 
1968/69, exposé 355 and Swinnerton—Dyer’s lecture at Antwerp (Lecture 
Notes, n° 350, Springer, 1973). 


‘) This table is taken from D. H. Lenmer, Ramanujan’s function r(n), Duke Math. J., 
10, 1943, which gives the values of r(m) for n s 300. 
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We end up with an open question, raised by D. H. Lehmer: 
Is it true that r(m) + 0 for all n > 1? 
It is so forn < 10'°. 


§5. Hecke operators 
5.1. Definition of the T(n) 


Correspondences.—Let E be a set and let X, be the free abelian group 
generated by E. A correspondence on E (with integer coefficients) is a 
homomorphism T of X;, into itself. We can give T by its values on the ele- 
ments x of E: 

(58) Tex) = Ynady, n(x) €Z, 
ye 
the n,(x) being zero for almost all y. 

Let F be a numerical valued function on £. By Z-linearity it extends to a 
function, again denoted F, on X,. The transform of F by T, denoted 7F, is 
the restriction to E of the function Fo 7. With the notations of (58), 

(59) TF(x) = F(T(x)) = 2 n(x)F (y). 
ye 

The T(n).—Let & be the set of lattices of C (see n° 2.2). Let n be an integer 
= 1. We denote by 7(n) the correspondence on & which transforms a lattice 
to the sum (in XQ) of its sub-lattices of index n. Thus we have: 

(60) Tmr= YY IY fle. 
(tT: jaa 

The sum on the right side is finite. Indeed, the lattices I’ all contain nT 
and their number is also the number of subgroups of order n of ['/nl = 
(Z/nZ)*. If n is prime, one sees easily that this number is equal to n+! 
(number of points of the projective line over a field with n elements). 

We also use the homothety operators R, (A « C*) defined by 


(61) R= flee. 


Formulas.—\t makes sense to compose the correspondences 7(n) and 
R,, since they are endomorphisms of the abelian group Xy. 


Proposition 10.—-The correspondences T(n) and R, verify the identities 


(62) R,R, = Ri, (A, we C*) 

(63) R,T(n) = T()R, (n 2 1,AeEC*) 
(64) T(m)T(n) = T(mn) if(m,n) = | 

(65) T(p")T(p) = T(p"*')+pT(p""")R, — (p prime, n = 1). 


Formulas (62) and (63) are trivial. 
Formula (64) is equivalent to the following assertion: Let m, n be two 
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relatively prime integers 21, and let I” be a sublattice of a lattice I° of 
index mn; there exists a unique sublattice I’ of I’, containing I”, such that 
((:P’) = nand (I’:I”) = m. This assertion follows itself from the fact that 
the group ['/I”, which is of order mn, decomposes uniquely into a direct 
sum of a group of order m and a group of order n (Bezout’s theorem). 

To prove (65), let IT be a lattice. Then 7(p")T(p)T, T(p"**)f and 
T(p"~")R,I are linear combinations of lattices contained in I and of index 
p"*' in T' (note that R,I is of index p? in ©). Let I” be such a lattice; in the 
above linear combinations it appears with coefficients a, b, c, say; we have 
to show that a = b+pc, i.e. that a = 1+>pc since b is clearly equal to 1. 

We have two cases: 

i) I” is not contained in pI’. Then c = 0 and a is the number of lattices I’, 
intermediate between I’ and I’, and of index p in T; such a lattice I’ contains 
pl. In [/pP the image of I’ is of index p and it contains the image of I’ 
which is of order p (hence also of index p because I'/pT is of order p’); 
hence there is only one [°’ which does the trick. This gives a = | and the 
formula a = 1+ pce 1s valid. 

ii) L” < pl’. We have c = 1; any lattice I’ of index p in I contains pI’, 
thus a fortiori T”. This gives a = p+1 and a = 1+ pc is again valid. 


Corollary 1.—The T(p"), n > 1, are polynomials in T(p) and R,. 
This follows from (65) by induction on a. 


Corollary 2.—The algebra generated by the R, and the T(p), p prime, is 
commutative; it contains all the T(n). 
This follows from prop. 10 and cor. 1. 


Action of T(n) on the functions of weight 2k. 
Let F be a function on & of weight 2k (cf. n° 2.2). By definition 
(66) R,F=X **F  forallAe C*. 
Let n be an integer 21. Formula (63) shows that 

R,(T(n)F) = T(n) (R,F) = A*T(n)F, 

in other words T(n)F is also of weight 2k. Formulas (64) and (65) give: 
(67) T(m)T(n)F = T(mn)F if(m, n) = 1, 
(68) = T(p)T(p")F = T(p"**)F+p'~**T(p""')F, — p prime, n 2 1. 


5.2. A matrix lemma 


Let I’ be a lattice with basis {w,, w,} and let ” be an integer 21. The 
following lemma gives all the sublattices of I’ of index n: 


Lemma 2.—Let S, be the set of integer matrixes (5 ) with ad = n, 


azlOQsb<diIfc= (5 ’) is contained in S,, let T, be the sublattice 
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of T having for basis 


Ww; — aw, + bw, Ww = dus. 


The map o+>T, is a bijection of S, onto the set V(n) of sublattices of index 
nin ld. 

The fact that [., belongs to ['(2) follows from the fact that det(o) = n. 
Conversely let °’ ¢ [(n). We put 


Y, = [/"+Ze,) and Y, = Zw,/(T' A Zw,). 


These are cyclic groups generated respectively by the images of w, and wy. 
Let a and d be their orders. The exact sequence 


0>Y, >’ > Y,>0 


shows that ad =n. If w; = dw,, then w,e¢I’. On the other hand, there 
exists w, € I’’ such that 
w, = aw, (mod Zw,). 


It is clear that w, and w, form a basis of 1°’. Moreover, we can write w, in 
the form 
w, =aw,t+bw, withbe Z, 


where b is uniquely determined modulo d. If we impose on 6 the inequality 
0 < b < d, this fixes b, thus also w,;. Thus we have associated to every 
I’ e« F(n) a matrix o(I’) ¢ S,, and one checks that the maps o++ I, and 
I’ ++ o(I’) are inverses to each other; the lemma follows. 


Example.—lf p is a prime, the elements of S, are the matrix ( i} 


and the p matrices (5 ) with O < b < p. 


5.3. Action of T(n) on modular functions 


Let & be an integer, and let f be a weakly modular function of weight 2k, 
cf. n° 2.1. As we saw in n° 2.2, f corresponds to a function F of weight 2k on 
& such that 


(69) F(P(w,09)) = wz *f(o4/w2). 


We define T(n)f as the function on H associated to the function n**~'T(n)F 
on &. (Note the numerical coefficient n7*~' which gives formulas “without 
denominators” in what follows.) Thus by definition: 


(70) T(n) f(z) = n?*-'T(n)F(L(z, 1), 

or else by lemma 2: 

(71) T(n)f(z) =n! ay (ST) | 
aZl,ad=n 
0Of2b< 


Hecke operators 101 
Proposition 11.—The function T(n)f is weakly modular of weight 2k. It 

is holomorphic on H if f is. We have: 

(72) T(m)T(n)f = T(mn) if (m,n) = 1, 

(73) T(p)T(p")f = T(p"*")ftp™*'T(p""")f, if pis prime, n = 1. 


Formula (71) shows that 7(n)f is meromorphic on H, thus weakly 
modular; if in addition f is holomorphic, so is T(n)f- Formulas (72) and 
(73) follow from formulas (67) and (68) taking into account the numerical 
coefficient n2*~' incorporated into the definition of T(n)/f. 


IV 


Behavior at infinity.—We suppose that f is a modular function, 1.e. is 
meromorphic at infinity. Let 


(74) f(z) = 2, c(m)q™ 


be its Laurent expansion with respect to g = e 


Proposition 12.—The function T(n)f is a modular function. We have 


(75) T(n)f(z) = 2, y(m)q” 
with 
(76) y(m) = ow azk- (a 


By definition, we have: 


T(n)f(z) =n?" Yd 7® ¥ cle? 7imlazt bvid 


ad=n,a21 meZ 
Osb<d 
Now the sum 
y e2ni bm/d 
O0<b<d 


is equal to dif d divides m and to 0 otherwise. Thus we have, putting m/d = m’: 


T(n)f(z) _ neko y d~***o(m'd)qa™. 
ad=n 
aZzl, m’eZ 


Collecting powers of q, this gives: 
n 2k~-1 d 
Tonfz)= Da" ¥ (3) (Mt). 
uez a|(n, u) a a 
az! 
Since f is meromorphic at infinity, there exists an integer N 2 O such that 


c(m) = Oif m Ss —N. The (“) are thus zero for » < —nN, which shows 
a 


that T(n)f is also meromorphic at infinity. Since it is weakly modular, it is a 
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modular function. The fact that its coefficients are given by formula (76) 
follows from the above computation. 


Corollary 1.— (0) = o,,_,(n)c(O) and y(1) = c(n). 
Corollary 2.—/f n = p with p prime, one has 

y(m) = c(pm) ifm # 0 (mod p) 

y(m) = c(pm)+p*"*c(m/p) ifm = 0 (mod p). 


Corollary 3.—/f f is a modular form (resp. a cusp form), so is T(n)f. 
This is clear. 


Thus, the 7(n) act on the spaces M, and M? of n° 3.2. As we saw above, 
the operators thus defined commute with each other and satisfy the identities: 


(72) T(m)T(n) = T(mn) if(m,n) = 1 
(73) T(p)T(p") = T(p’*')+p*"'T(p""") if pis prime, n 2 1. 


5.4. Eigenfunctions of the T(n) 


Let f(z) = & c(n)g” be a modular form of weight 2k, k > 0, not identically 
n=0 


zero. We assume that f is an eigenfunction of all the 7T(n), i.e. that there 
exists a complex number A(7) such that 


(77) T(n)f = A(n)f foralln 2 1. 
Theorem 7.—a) The coefficient c(1) of q in f is +0. 
b) If f is normalized by the condition c(1) = 1, then 

(78) c(n) = A(n) foralln > 1. 


Cor. 1 to prop. 12 shows that the coefficient of g in T(n)/ is c(n). On the 
other hand, by (77), it is also A(m)c(1). Thus we have c(m) = A(n)c(1). If 
c(1) were zero, all the c(n), n > 0, would be zero, and f would be a constant 
which is absurd. Hence a) and b). 


Corollary 1.—Two modular forms of weight 2k, k > 0, which are eigen- 
Junctions of the T(n) with the same eigenvalues X(n), and which are normalized, 
coincide. 

This follows from a) applied to the difference of the two functions. 


Corollary 2.—Under the hypothesis of theorem 7, b): 
(79) c(m)c(n) = c(mn) if(m,n) = ] 
(80) c(p)c(p") = c(p"**)+p**~*c(p"~"). 

Indeed the eigenvalues A(n) = c(n) satisfy the same identities (72) and 
(73) as the T(n). 


Formulas (79) and (80) can be translated analytically in the following 
manner: 
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Let 


(81) -(s) = y c(n)/n° 


be the Dirichlet series defined by the c(n); by the cor. of th. 5, this series 
converges absolutely for R(s) > 2k. 


Corollary 3.—We have: 


I 
82 ® (s) = [] —-———__~ 
( ) r(5) Lapp ap et 


By (79) the function n++ c(n) is multiplicative. Thus lemma 4 of chap. 
VII, n° 3.1 shows that ®,(s) is the product of the series aa c(p")p"*. Putting 


p * = T, we are reduced to proving the identity 


(83) > c(p")T" = _t. where ©, (7) = 1—c(p)T+p**"'T?. 
= ©, (T) 


Form the series 


MT) = (¥ e(o*yT") (1—ep)T+p*-'T?). 
The coefficient of T in is c(p)—c(p) = 0. That of T7"*', n = 1, is 


c(p"**)—e(p)c(p") + p**~'e(p"~*), 


which is zero by (80). Thus the series % is reduced to its constant term ¢c (1) = 1, 
and this proves (83). 


Remarks.—1) Conversely, forinulas (81) and (82) imply (79) and (80). 

2) Hecke has proved that ®, extends analytically to a meromorphic 
function on the whole complex plane (it is even holomorphic if f is a cusp 
form) and that the function 


(84) X,(s) = (27) ~*T(s)®,(s) 
satisfies the functional equation 
(85) Xs) = (—1)'X,(2k—s). 


The proof uses Mellin’s formula 
X,(s) = | (Mi) —flooyy ® 
0 


combined with the identity /(—1/z) = z7“f(z). Hecke also proved a con- 
verse: every Dirichlet seties ® which satisfies a functional equation of this 
type, and some regularity and growth hypothesis, comes from a modular 
form f of weight 2k; moreover, f is a normalized eigenfunction of the 7(n) 
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if and only if ¢ is an Eulerian product of type (82). See for more details 
E. Hecke, Math. Werke, n° 33, and A. WeiL, Math. Annalen, 168, 1967. 


5.5. Examples 


a) Eisenstein series.—Let k be an integer 22. 


Proposition 13.—The Eisenstein series G, is an eigenfunction of T(n); 
the corresponding eigenvalue is o,,-— ,(n) and the normalized eigenfunction is 


oO 


(86) (1 By t E, = (=I if + Y oa-sn)q" 


The corresponding Dirichlet series is €(s)f(s—2k +1). 
We prove first that G, 1s an eigenfunction of T(n); it suffices to do this for 
7(p), p prime. Consider G, as a function on the set & of lattices of C; we have: 
G(T) = >" 1/y**, cf. n° 2.3, 


yer 


T(p)G(T) = dD y™. 


(T: P’)=p yer’ 


and 


Let ye I. If ye pl then y belongs to each of the p+1 sublattices of [ of 

index p; its contribution in T(p)G,(T) is (p+1)/y**. If ye T—pl’, then y 

belongs to only one sublattice of index p and its contribution is 1/y?*. Thus 
T(p)G AT) = GAT)+p > I/y** = GT) +pG,(pl) 


yepl 
= (1+p'~*)G,(P), 


which proves that G, (viewed as a function on @) is an eigenfunction of 
T(p) with eigenvalue 1+p'~?*; viewed as a modular form, G, is thus an 
eigenfunction of '(p) with eigenvalue p?*~1(1+p!~?*) = o,,_,(p). Formulas 
(34) and (35) of n° 4.2 show that the normalized eigenfunction associated with 
G,, 1S 


“I + +) 724-1 (1)q". 


This also shows that the vcenvalue of 7(n) are o3,_ ,(n). Finally 


y o2,-,(n)/n' = Yo ar*~'/atd’ 
n= 1 a 


(A MAYA Ne) 


| = {(s)(s—2k+1). 
b) The A function 


Proposition 14.—The A function is an eigenfunction of T(n). The corre- 
sponding eigenvalue is r(n) and the normalized eigenfunction is 


Qn)" 7A = 9 TT -ay* = ¥ rina 
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This is clear, since the space of cusp forms of weight 12 1s of dimension 
1, and is stable by the 7(n). 


Corollary.— We have 

(52) t(nm) = 7(n)r(m) if(n, m) = 1, 

(53) r(p)t(p") = r(p"*')+p''r(p""') if p isa prime,n 2 1. 
This follows from cor. 2 of th. 7. 


Remark.—There are similar results when the space M? of cusp forms 
of weight 2k has dimension 1; this happens for 


k = 6,8, 9, 10, 11, 13 with basis A, AG,, AG,, AG,, AG,, and AG,. 
5.6. Complements 


5.6.1. The Petersson scalar product. 
Let f, g be two cusp forms of weight 2k with k > 0. One proves easily 
that the measure 


u(fig) = f(z)g(z)\y**dxdy/y? (x = R(z), y = Im(z)) 


is invariant by G and that it is a bounded measure on the quotient space H/G. 
By putting 


(87) (f.8> = | whe) = | f@e@)y*"7dxdy, 
D 


H/[G 


we obtain a hermitian scalar product on M? which is positive and non- 
degenerate. One can check that 


(88) (T(n)f, 8> = <f, T(n)g), 


which means that the 7(n) are hermitian operators with respect to <f, g>. 
Since the 7(n) commute with each other, a well known argument shows that 
there exists an orthogonal basis of Mf made of eigenvectors of T(n) and that 
the eigenvalues of 7(n) are real numbers. 


5.6.2. Integrality properties. 
Let M,(Z) be the set of modular forms 


f= y c(n)q" 


of weight 2k whose coefficients c(n) are integers. One can prove that there 
exists a Z-basis of M,(Z) which is a C-basis of M,. [More precisely, one can 
check that M,(Z) has the following basis (recall that F = q Il (1 —g")?*): 
k even: One takes the monomials E%F* where a, Be N, and «+36 = k/2; 
k odd: One takes the monomials E,E3F* where a, Be N, and «+38 = 
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(k — 3)/2.] Proposition 12 shows that M,(Z) is stable under T(n), n 2 1. We 


conclude from this that the coefficients of the characteristic polynomial of T(n), 
acting on M,, are integers'; in particular the eigenvalues of the 7(n) are 


algebraic integers (“totally real’, by 5.6.1). 


5.6.3. The Ramanujan-Petersson conjecture. 
Let f= 2 c(n)q", c(l) = 1, be a cusp form of weight 2k which is a 


normalized eigenfunction of the 7(w). 
Let ®, (7) = 1—c(p)T+p**"'T’, p prime, be the polynomial defined in 
n° 5.4, formula (83). We can write 


(89) D, (7) = (l-«,T)(l—«,7) 
with 
(90) Xpta, = C(p), xx, = peo! 


The Petersson conjecture is that «, and «, are complex conjugate. One can 
also express it by: 
leo] = lop! = ph *”, 
or 
le(p)| S 2p*'”", 
or 
Ic(n)| S$ n*~'!?aQ(n) — foralln = I. 


For k = 6, this is the Ramanujan conjecture: |7(p)| < 2p'!!?. 


These conjectures have been proved in 1973 by P. Deligne (Publ. Math. 
1.H.E.S. 1°43, p. 302), as consequences of the ‘Weil conjectures” about 
algebraic varieties over finite fields. 


$6. Theta functions 
6.1. The Poisson formula 


Let V be a real vector space of finite dimension n endowed with an 
invariant measure ». Let V' be the dual of V. Let f be a rapidly decreasing 
smooth function on V (see, L. SCHWARTZ, Théorie des Distributions, chap. 
VII, §3). The Fourier transform f’ of fis defined by the formula 


(91) £0) = [eo F(x)u(2). 
V 
This is a rapidly decreasing smooth function on V’. 
Let now [ be a /attice in V’ (see n° 2.2). We denote by I’ the lattice 
in V’ dualto T; it is the set of y e V’ such that <x, y> e Z for all x e P. One 


|) We point out that there exists an explicit formula giving the trace of T(m), cf. 
M. EICHLER and A. SeLBerG, Journ. Indian Math. Soc., 20, 1956. 
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checks easily that ['’ may be identified with the Z-dual of [ (hence the 
terminology). 


Proposition 15.—Let v = p(V/T). One has: 
(92) ¥ fe) == Y FO) 


xel" 
After replacing » by v~'u, we can assume that u(V/l') = 1. By taking a 
basis e,,...,e, of I, we identify V with R’, F with Z", and » with the 
product measure dx, ...dx,. Thus we have V’ = R’, I’ = Z" and we are 
reduced to the classical Poisson formula (SCHWARTZ, /oc. cit., formule (VII, 
7:5)). 


6.2. Application to quadratic forms 


We suppose henceforth that V is endowed with a symmetric bilinear 
form x.y which is positive and nondegenerate (i.e. x.x > 0 if x +0). We 
identify V with V’ by means of this bilinear form. The lattice I’’ becomes 
thus a /Jattice in V; one has y € I’ if and only if x.y e Z for all x eT. 

To a lattice [', we associate the following function defined on R*: 
(93) O(N = Yet, 

xel 

We choose the invariant measure » on V such that, if ¢,,...¢, iS an 
orthonormal basis of V, the unit cube defined by the ¢,; has volume 1. The 
volume v of the lattice I’ is then defined by v = u(V/T), cf. n° 6.1. 


Proposition 16.—We have the identity 
(94) O-(t) = 1°"707'0,-(t7'). 


Let f=e ™*. It is a rapidly decreasing smooth function on V. The 
Fourier transform f’ of f is equal to f. Indeed, choose an orthonormal basis 
of V and use this basis to identify V with R”; the measure » becomes the 
measure dx = dx,...dx, and the function / is 


f _ e~napt-: +xA), 


2: —nx2 


We are thus reduced to showing that the Fourier transform of e~** ise ™*’, 
which is well known. 

We now apply prop. 15 to the function f and to the lattice t!/?T'; the 
volume of this lattice is t’/*v and its dual is ¢~ '/?I"’; hence we get the formula 
to be proved. 


6.3. Matrix interpretation 


Let e,,...,¢, be a basis of I. Put a;; = e;.e;. The matrix A = (a;,) is 
positive, nondegenerate and symmetric. If x = Xx;e; is an element of V, then 


XX = La; jX{X;. 


108 Modular forms 


The function ©, can be written 


(95) O,(t) = Yi en MBaunr, 


xjeZ 


The volume v of I is given by: 


(96) v = det(A)'/, 
This can be seen as follows: Let €,,...,¢, be an orthonormal basis of V 
and put 


E= E;A...AE, C= OYA... NC» 


We have e = Ae with |A| = v. Moreover, e.e = det(A) ¢.e, and by comparing, 
we obtain v? = det(A). 

Let B = (6;;) be the matrix inverse to A. One checks immediately that 
the dual basis (e;) to (e;) is given by the formulas: 


The (e;) form a basis of [’. The matrix (e;.e;) is equal to B. This shows in 
particular that if v’ = n(V/T’), then we have vv’ = 1. 


6.4. Special case 


We will be interested in pairs (V, I) which have the following two 
properties: 


(i) The dual T’ of T is equal to Y. 

This amounts to saying that one has x.y « Z for x, ye IT and that the 
form x.y defines an isomorphism of T onto its dual. In matrix terms, it means 
that the matrix A = (e;.e,) has integer coefficients and that its determinant 
equals 1. By (96) the last condition is equivalent to v = 1. 

If n = dim V, this condition implies that the quadratic module I belongs 
to the category S, defined in n° 1.1 of chap. V. Conversely, if I’ eS, is 
positive definite, and if one puts V = [ @ R, the pair (V, I) satisfies (1). 


(ii) We have x.x = 0 (mod 2) for all x eT. 

This means that I is of type II, in the sense of chap. V, n° 1.3.5, or else 
that the diagonal terms e;.e; of the matrix A are even. 

We have given in chap. V some examples of such lattices I’. 


6.5. Theta functions 


In this section and the next one, we assume that the pair (V, I") satisfies 
conditions (i) and (i1) of the preceding section. 

Let m be an integer 20, and denote by r,(m) the number of elements 
x of I such that x.x = 2m. It is easy to see that r-(m) is bounded by a 
polynomial in m (a crude volume argument gives for instance r,-(m) = 
O(m"/?)). This shows that the series with integer coefficients 
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2, rr(m)q” = ] +r-(lqt. oe 


converges for |g| < 1. Thus one can define a function @, on the half plane H 
by the formula 


(97) 6(z) = y r-(m)q™ (whereg = e2niz) 
m=0 
We have: 
(98) O(z) = Vg? = J, etitlen, 
xel’ xel’ 


The function 6, is called the theta function of the quadratic module I’. It is 
holomorphic on H. 


Theorem 8.—(a) The dimension n of V is divisible by 8. 
(b) The function 6, is a modular form of weight n/2. 


Assertion (a) has already been proved (chap. V. n° 2.1, cor. 2 to th. 2). 
We prove the identity 


(99) 6-(—-1/z) = (iz)"!?8,(z). 


Since the two sides are analytic in z, it suffices to prove this formula when 
z = it with t real >0. We have 


6fit)= Yet” = O-(2). 
xel 


Similarly, 6,-(—1/it) = ©,(t~'). Formula (99) results thus from (94), taking 
into account that v = landT =I’. 
Since n is divisible by 8, we can rewrite (99) in the form 


(100) 8,-(—1/z) = 2"/78,(z) 
which shows that 9, is a modular form of weight n/2. 

[We indicate briefly another proof of (a). Suppose that 7 is not divisible 
by 8; replacing I, if necessary, by T@®T or T@Ir@L QT, we may 
suppose that n = 4 (mod 8). Formula (99) can then be written 

O(—1/z) = (—1)"42"76,(z) = —2"? 6, (2). 


If we put w(z) = 0,(z)dz"’*, we see that the differential form w is transformed 
into —w by S:z++ —1/z. Since w is invariant by T:z-> z+1, we see that ST 
transforms w into —w, which is absurd because (ST)* = 1.] 


Corollary 1.—There exists a cusp form fr of weight n/2 such that 
(101) 6. = E, +hr where k = n/4. 
This follows from the fact that 6;(00) = 1, hence that 0,—&, is a cusp 


form. 


Corollary 2.— We have r-(m) = Fe O>,-,(m)+O(m") where k = n/4. 


k 
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This follows from cor. 1, formula (34) and th. 5. 


Remark.—The “error term” /; is in general not zero. However Siegel 
has proved that the weighted mean of the fr is zero. More precisely, let C, 
be the set of classes (up to isomorphism) of lattices [’ verifying (i) and (ii) 
and denote by g, the order of the automorphism group of I € C, (cf. chap. V, 
n° 2,3). One has: 


| l 
(102) > —.fr=0 
reC, Or 
or equivalently 
(103) y 1 6,=M,E, whereM,= ¥ —. 
TeCn gr rec, gr 


Note that this is also equivalent to saying that the weighted mean of the 0, 


is an eigenfunction of the T(n). 
For a proof of formulas (102) and (103), see C. L. SIEGEL, Gesam. Abh., 
n° 20. 


6.6. Examples 


1) The case n = 8. 
Every cusp form of weight n/2 = 4 ts zero. Cor. | of th. 8 then shows 


that 0, = E,, in other words: 
(104) r-(m) = 2400;(m) for all integers m 2 1. 


This applies to the lattice I, constructed in chap. V, n° 1.4.3 (note that this 
lattice is the only element of C;,). 


ll) The casen = 16. 
For the same reason as above, we have: 


(105) 0, = Ey = 1+480 x o(m)q”. 


Here one may take = [, @ I, or ' = Ty, (with the notations of chap. V, 
n° 1.4.3); even though these two lattices are not isomorphic, they have the 
same theta function, 1.e. they represent each integer the same number of 
times. 

Note that the function 6 attached to the lattice I’, @ I’, is the square of 
the function 8 of I',; we recover thus the identity: 


(1 +240 y oma") = 1+480 y o4(m)q™. 


il) The case n = 24. 
The space of modular forms of weight 12 is of dimension 2. It has for 
basis the two functions: 


- y o1,(m)q", 


5520 
E, = 1 + 
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ie 8) oD 
F = (20) '*A =q [| (1-9")** = ¥ x(m)q™. 
m=1 m= i 


The theta function associated with the lattice [ can thus be written 


(106) 6-=EgterF withc,€Q. 
We have 
(107) re(on) = 2° oy (0m) +era(mm) for m & 1. 


The coefficient c, is determined by putting m = 1: 


65520 
08 = 1) — ——.. 
(108) Cr =rr(l) 691 


Note that it is +0 since 65520/691 is not an integer. 


Examples. 
a) The lattice [ constructed by J. LEECH (Canad. J. Math., 16, 1964) is such 
that r-(1) = 0. Hence: 


Ccr=- 65520 = —2*,37.5.7.13/691. 
691 
b) For f =T, @®T, @ I's, we have r-(1) = 3.240, hence: 
Cr = 432000 _ 273753/69). 
691 
c) For [ = P54, we have r-(1) = 2.24.23, hence: 
op = 21844 _ 4103 997/691. 
691 


6.7. Complements 


The fact that we consider only the full modular group G = PSL,(Z), 
forced us to limit ourselves to lattices verifying the very restrictive conditions 
of n° 6.4. In particular, we have not treated the most natural case, that of the 
quadratic forms 

xP... 427, 

which verify (i) but not (ii). The corresponding theta functions are ‘‘modular 
forms of weight n/2” (note that m/2 is not necessarily an integer) with respect 
to the subgroup of G generated by S and 7’. This group has index 3 in G, 
and its fundamental domain has two “cusps” to which correspond two 
types of “Eisenstein series”; using them, one obtains formulas giving the 
number of representations of an integer as a sum of nm squares; for more 
details, see the books and papers quoted in the bibliography. 
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